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Abstract 

We show how many contemporary issues in event generation can be recast in terms of partonic 
calculations with a matching scale. This framework is called GenEvA, and a key ingredient is a new 
notion of phase space which avoids the problem of phase space double-counting by construction 
and includes a built-in definition of a matching scale. This matching scale can be used to smoothly 
merge any partonic calculation with a parton shower. The best partonic calculation for a given 
region of phase space can be determined through physics considerations alone, independent of 
the algorithmic details of the merging. As an explicit example, we construct a positive-weight 
partonic calculation for e~^e~ — > n jets at next-to-leading order (NLO) with leading-logarithmic 
(LL) resummation. We improve on the NLO/LL result by adding additional higher-multiplicity 
tree-level (LO) calculations to obtain a merged NLO/LO/LL result. These results are implemented 
using a new phase space generator introduced in a companion paper Q. 
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1. INTRODUCTION 



Despite facing a complicated detector environment, top quark p[ and W boson p, ^ 
studies at the Tevatron have proved that it is nevertheless possible to make precision mea- 
surements at hadron colliders. With the upcoming Large Hadron Collider (LHC), a complete 
understanding of Standard Model backgrounds will be essential for discovering new physics 
at the energy frontier 0]. Therefore, precision theoretical calculations are needed to 

complement the increasingly sophisticated experimental techniques available at hadron col- 
liders. At lepton colliders, data is often compared to theoretical predictions for inclusive 
quantities, but at hadron colliders, it is more typical for data to be compared to theoretical 
predictions for exclusive quantities in order to more readily apply experimental cuts that 
may not be well defined in an inclusive theoretical framework. 

Monte Carlo programs have proved indispensable for making exclusive theoretical predic- 
tions. Together with parton distribution functions, hadronization models, and underlying 
event models, traditional event generators [|, |, |lU|, |Tl|, ^ H, [l^, H H 



agree remarkably well with Tevatron data over a wide variety of experimental observ- 
ables However, in anticipation of further theoretical progress in refining Stan- 

dard Model (and Beyond the Standard Model) predictions, it is worthwhile to consider 
possible improvements to the traditional Monte Carlo approach. There has been much 
work in recent years on merging fixed-order matrix element calculations with parton show- 

\, EB. Furthermore, there are several programs 



ers 



, 28, 29, 30, 31, 32, 33 



which implement higher-order calculations to produce inclusive cross sections [36, 37, 38, 39 



though these next-to-leading (NLO) order programs are not cast in the form of a Monte 
Carlo program that can generate fully-hadronized exclusive events. While several ideas 
exist in the literature of how to implement a combination of NLO results and parton show- 

m 0, 



U B3, p3, only a few publicly available 



ers 



40 



45 



programs currently exist |53, E^ that implement this properly. 



At the level of partons, current event generators are based on two independent frame- 
works: fixed-order matrix element calculations and parton showers. While fixed-order cal- 
culations include quantum interference and can be systematically improved through pertur- 
bative loop calculations, parton showers are always necessary to generate additional QCD 
radiation in a Monte Carlo framework. This is because fixed-order calculations cannot han- 
dle the large number of final states typically present in high-energy collisions. Schematically, 
the way the fully differential hadronic cross section da is achieved in a traditional approach 
is through 

da*'""'^- = MC {\M\^d<!>) , (1) 

where |A^p represents a fixed-order QCD calculation, d$ represents a fixed-multiplicity 
phase space algorithm, and MC represents the action of a showering/hadronization scheme. 
The main challenge of Eq. (|l|) is that the actual partonic four-momenta are generated both 
through the action of d$ and MC, and in order to avoid phase space double-counting, either 
I^Wp has to be modified from the value calculated in QCD, or the action of MC has to 
modified to accommodate the fixed-order calculation, or both. 

In this paper, we present a new Monte Carlo framework GenEvA — for Generate Events 
Analytically — that allows almost any parton-level calculation to be translated into hadron- 
level events in a generic way with only a minimal modification of field theoretic methods.^ 



^ This framework is unrelated to the "Geneva" jet algorithm (56| 
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The key idea is to separate the physics considerations, which determine the appropriate 
distribution to use, from the algorithmic details, which define how phase space is generated 
and what models are used to perform additional showering and hadronization. As we will 
see, it is possible to separate the physics and algorithmic issues using a matching scale /i. 
Schematically, the fully differential hadronic cross section da in GenEvA is given by 

da'^^-E^* = |A<(/i)pdMC(/i). (2) 

The quantity dMC(/i) represents both a phase space algorithm and a shower- 
ing/hadronization scheme that starts generating radiation at the scale /i. The only informa- 
tion the "partonic calculation" |A1(yu)p needs to know about the specific implementation of 
phase space generation and showering is this matching scale fi. From the point of view of 
the partonic calculation, fi is an infrared scale and can be thought of as the scale at which 
the partonic calculation is interfaced with a parton shower. We will see that the details of 
this matching are independent of any specific parton shower algorithm. While the partonic 
calculation does not necessarily correspond to a traditional QCD amplitude,^ we 

will show various ways that |A^(yu)p can be determined solely in the context of perturbative 
QCD, independent of the algorithmic details of dMC(/i). 

In this way, GenEvA allows the user to focus on determining the most accurate differential 
cross section in a given region of phase space, instead of trying to figure out an algorithm 
which not only covers phase space with the right distributions, but also interfaces cleanly 
with a showering/hadronization scheme. As a concrete example of the power of the GenEvA 
framework, we will present a conceptually simple implementation of two known methods to 
improve the accuracy of Monte Carlo: parton shower /matrix element merging (PS/ME) ||2^, 
0, |28|, gl, |3g, |T], 0, H, g, m and Monte Carlo at next-to-leading order 



(PS/NLO) ig, |T], H, H, |§, ^ 1^, 0, In previous approaches, 



PS/ME merging and PS/NLO merging relied on different Monte Carlo algorithms, but in 
the context of GenEvA, they correspond just to different choices for |7Vl(yu)p and share the 
same algorithmic underpinnings. 

In fact, GenEvA allows PS/ME merging and PS/NLO merging to be combined to create 
a positive-weight Monte Carlo sample that merges NLO information with higher-order tree- 
level (LO) matrix elements and leading-logarithmic (LL) resummation. The simplicity with 
which we achieve an NLO/LO/LL merged sample through a special choice for 
suggests obvious generalizations to including next-to-next-to-leading order (NNLO) or next- 
to-leading logarithmic (NLL) information in event generators, though we will not pursue 
those directions in the present work. 

Our focus will be on getting the most accurate distribution possible using available theo- 
retical tools, and not necessarily on generating these distributions efficiently. In a companion 



paper ||T|, we describe a dMC(/i) generator based on parton shower reweighting and 
we will see that certain gains in efficiency come as a bonus for using the parton shower as 
a phase space generator. However, the GenEvA framework is more general than the specific 



^ In the context of soft-coUinear effective theory (SCET) |5|, |5^, corresponds to the square 

of a Wilson coefficient and dMC(/i) corresponds to the matrix element of an SCET operator |62[ . 
This explains why there should be a cancellation of the fi dependence in |A^(/i)pdMC(/i). While SCET 
offers formal definitions for what |A^(/i)p, dMC(/i), and /i are, these formal definitions are not necessarily 
required. 
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algorithmic implementation in Ref. |T[]. Though we anticipate extending GenEvA to han- 
dle the full range of Standard Model processes at hadron colliders, we will focus here on 
the process e~^e~ n jets, which is sufficiently complicated to highlight all of the novel 
techniques introduced by GenEvA. In the companion paper [|I|, we comment on additional 
technical issues that arise in trying to understand Tevatron or LHC physics. 

In the next section, we review the challenges faced when constructing Monte Carlo tools 
and the insights offered by GenEvA. In Sec. 0, we discuss the GenEvA framework in a simple 
toy model. We then transition to the more complicated case of QCD, where we discuss the 
first emission in detail in Sec. |^ and extend the results to multiple emissions in Sec. |^. After 
showing results from the GenEvA program in Sec. we conclude in Sec. |^. 

2. OVERVIEW OF THE GENEVA FRAMEWORK 
A. The Challenge of QCD 

To build a perfect Monte Carlo program, one would need a complete description of the 
Standard Model that is valid over all of phase space. Unfortunately, no such description 
exists, mainly because we do not have a complete description of QCD that is valid for every 
energy scale and every kinematic configuration. Instead, we know various limits of QCD, 
various perturbative expansions of QCD, and various phenomenological models based on 
QCD. The best Standard Model Monte Carlo program we can hope to build is one that 
coherently combines as many descriptions of QCD as possible. 

One crucial combination that has been the subject of many recent advances in Monte 
Carlo is the merging of fixed-order calculations with parton showers. Fixed-order calcu- 
lations are reliable when pairs of partons are well separated in phase space, while parton 
showers are reliable in the soft-coUinear limit. For this reason, a Monte Carlo program 
that combines both descriptions has a better chance to correctly describe experimental data 
over a broad range of observables and energy scales. The problem with the language of 
"fixed-order calculation" and "parton shower" is that these terms convolute calculational 
definitions with algorithmic ones, and the goal of the GenEvA framework is to isolate the 
calculational from the algorithmic challenges involved in combining different descriptions of 
QCD. 

As shown in Fig. ||, a fixed-order calculation is based on a perturbative expansion of 
QCD to fixed order in as- To distribute events according to a fixed-order calculation, 
one usually uses a non-Markovian algorithm (such as an adaptive grid) to generate points 
in fixed-multiplicity n-body phase space d$„. The parton shower, on the other hand, is 
defined in the soft-coUinear limit of QCD, and the splitting functions and Sudakov factors 
are calculated in this limit. Parton showers are usually used in the context of a Markovian 
phase space algorithm that recursively generates all of variable-multiplicity phase space 
through a probabilistic mapping of d$„ —>■ d$„+i.^ Therefore, in trying to combine a fixed- 
order calculation with a parton shower, one is simultaneously trying to merge two different 
expansions of QCD and two different phase space algorithms. Indeed, currently there does 
not exist a solution for how to merge generic fixed-order calculations with parton showers, 
mainly because separate algorithmic merging procedures are currently necessary depending 



^ More general parton showers such as in Refs. ^ can also have non-Markovian aspects. 
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FIG. 1: The traditional approach to merging fixed-order calculations with parton showers. Tradi- 
tionally, fixed-order calculations tie a perturbative expansion to a non-Markovian fixed-multiplicity 
phase space algorithm. The parton shower ties a kinematic expansion to a Markovian variable- 
multiplicity phase space algorithm. Hence, in trying to merge fixed-order calculations with parton 
showers, one is led to simultaneously having to merge two different QCD expansions and two dif- 
ferent algorithmic methods to generate phase space. On the algorithmic side, imposing a phase 
space cut /i to separate the two different algorithms would results in residual /i dependence and 
uncanceled infrared divergences as explained in the text. GenEvA is based on imposing a calcula- 
tional fx cut that separates QCD calculations from QCD phenomenological models. This requires 
splitting the parton shower into a formal component and a phenomenological component. 



on whether one is considering tree-level calculations or one-loop calculations. 

Naively, one might hope to define an algorithmic phase space cut fi to separate fixed- 
order calculations from parton showers as on the right side of Fig. One could imagine 
using a fixed-order calculation when the invariant mass between two partons is greater than 
fi and using a parton shower when the invariant mass is less than /i. This algorithmic 
fi would immediately solve the problem of double-counting that arises when the parton 
shower acting on n-body phase space covers exactly the same phase space regions as an 
(m > n)-body generator. However, this approach is suspect for two reasons. First, the 
parton shower resums the leading Sudakov logarithms in the problem, so if the parton 
shower is only used below /x, it will exhibit double-logarithmic as log^ /i sensitivity to this 
arbitrary, unphysical /i scale. Second, infrared divergences in virtual diagrams contributing 
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to the fixed-order results are canceled by collinear and/or soft real-emission diagrams, which 
are by definition contributing below the scale /i and are therefore contained in the parton 
shower picture. Thus, one has to somehow link the fixed-order calculation to the parton 
shower in order to properly cancel infrared divergences. 

We argue that a better definition of /x is as a calculational scale that separates calcu- 
lations performed in QCD from phenomenological models based on QCD as on the left 
side of Fig. |I]. This requires cleanly separating two different uses of the parton shower. 
Above the scale n, the parton shower corresponds directly to quantities that can be defined 
formally in the soft-coUinear limit. In particular, the splitting functions p3| , |H7[] and Su- 



dakov factors |Q can be systematically derived [|61| , |62| using soft-coUinear effective theory 
(SCET) fS^, |6^. Below the scale /i, the parton shower should be regarded as simply 

a QCD-inspired phenomenological model. For example, a phenomenological shower can be 
used to extrapolate into regions of phase space away from the soft-collinear limit if high- 
multiplicity fixed-order matrix elements are unavailable, creating partonic final states that 
include the correct singularity and symmetry structure of QCD, but lack the full quantum 
interference. Similarly, a phenomenological shower can serve as the entry point to nonper- 
turbative hadronization in a general fragmentation scheme, and in that context, the shower 
can and should be tuned to data to reproduce the measured fragmentation properties seen 
in experiments. 

It is the fact that the parton shower has meaning above and below /i that can be used 
to mitigate the unphysical fi dependence, since it implies that the double-logarithmic de- 
pendence of the parton shower is identical to the double-logarithmic dependence of QCD. 
This guarantees that partonic calculations with leading-logarithmic improvements will have 
no leading fi dependence when interfaced with a parton shower at the scale fi. However, 
the challenge of defining fi in this way is that now it is not clear whether the issue of phase 
space double-counting will be solved, nor is it obvious that such a definition of /i will pro- 
vide a method to combine generic fixed-order calculations with a parton shower, especially 
in the presence of infrared divergences. At this point, it is thus worthwhile to reconsider 
the various components of Fig. |l| to distill the essential challenges in combining fixed-order 
calculations with parton showers. 



B. Combining Different QCD Descriptions 

In trying to combine different descriptions of QCD into a coherent Monte Carlo frame- 
work, there are really three definitions of what is meant by "combine" as shown in Fig. |^: 
combining different formal expansions, combining formal calculations with phenomenologi- 
cal models, and combining different phase space algorithms. These definitions of "combine" 
are in one-to-one correspondence with three technical problems that arise when trying to 
merge fixed-order calculations with parton showers: regulating infrared divergences, cancel- 
ing unphysical /i dependence, and eliminating double-counting. The power and simplicity 
of the GenEvA framework comes from cleanly separating these three issues. 

The first meaning of "combine" is to merge different formal expansions of QCD, in our 
case a perturbative expansion in as, with a kinematic expansion that allows resumming 
logarithms of some ratio r of energy scales. A successful combination of this form will result 
in a partonic calculation |A^(/i)p that is simultaneously correct to the calculated order in 
as while also including the desired level of logarithmic resummation. While there are formal 
procedures to calculate by matching QCD onto SCET [^, |62|, there are numerous 



7 



Pert ur bat ive 
Expansion 



a 
o 

"3 

o 
O 



merged with 

Partonic 
Calculation 



Kinematic 
Expansion 



merged with 

Matching 
Scale 



Phenomenological 
Model 



Non-Markovian 
Algorithm 



merged with 

GenEvA 

Algorithm 



Markovian 
Algorithm 



/i 



o 
53 



FIG. 2: The three different meanings of "combine": (T) Combining different expansions of QCD in 
a partonic calculation; (2) combining a partonic calculation with a phenomenological model; and 
(3) combining a non-Mar kovian phase space generator with a Markovian generator. By isolating 
the issues involved in merging fixed-order calculations with parton showers, GenEvA offers a simple 
and generic method to improve Monte Carlo. Note that all these meanings of "combine" respect the 
division between calculations on one side and algorithms on the other, and the GenEvA framework 
is devised to cleanly separate the calculational and algorithmic issues. 



correct choices for that satisfy the desired properties at the leading-logarithmic 

level. We will also see that the issue of infrared divergences is solved entirely in the context 
of a partonic calculation, because one can think of the cancellation of divergences between 
loop diagrams and tree diagrams as just another type of formal combination. 

We have included an explicit /i dependence in the partonic calculation in antic- 

ipation of the second meaning of "combine" , which is to combine a formal QCD calculation 
with a phenomenological model based on QCD. Below some scale /x, the phenomenological 
parton shower will be used to fill out phase space and interface with a nonperturbative 
hadronization model, but we already saw that this will introduce unphysical ag log^ jj, de- 
pendence in the final results. However, if we include leading-logarithmic resummation in 
the partonic calculation |A^(//)P, then this /i dependence will cancel by construction. At 
first glance, it appears that the partonic calculation now needs to know the details of the 
phenomenological shower in order to engineer the as log^ /i cancellation. Remember though, 
that this double-logarithmic /i dependence is a property of QCD and not of a specific par- 
ton shower, so the correct /j, dependence can be included in independently of the 
details of the shower. The only requirement is that the same formal definition of ^ is used 
in both the partonic calculation and the phenomenological shower. In practice, the choice 
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of /i corresponds to the choice of shower evolution variable. 

Finally, the third meaning of "combine" has to do with merging different algorithmic tech- 
niques. We saw that phase space can either be covered by a fixed-multiplicity non-Markovian 
algorithm or with a variable-multiplicity Markovian algorithm, and for computational rea- 
sons we may wish to use different algorithms in different regions of phase space. Because of 
the problem of double-counting, one needs to merge these two algorithms in such a way that 
a given point in phase space is covered once and only once. One option is to define a phase 
space cut /i, as mentioned already, and use a non-Markovian algorithm above that scale n 
and a Markovian algorithm below fi. We denote such phase space with a matching scale as 
dMC(/i), and an example of an efficient dMC(/i) generator is presented in the companion 
paper 

Let us return to the traditional approach in Eq. (P for creating a fully differential hadronic 
cross section, now with the final state multiplicity n made manifest: 

dor*-<^- = ^MC^„(|^„|2d$„) . (3) 

n 

Here, |A^nP is a fixed-order calculation with n final state partons, d<I>„ is n-body phase 
space, and MC^„ represents a phenomeno logical parton shower that starts at the scale fin- 
As mentioned already, current approaches to merging fixed-order calculations with parton 
showers require either a modification to to MC^,^, or to both, and GenEvA is no 

exception. The novelty of GenEvA is that these modifications can be made completely 
generic by applying the logic of Fig. |^. 

First, a fixed-order calculation lacks the correct behavior for small r, so we have to 
substitute a partonic calculation that includes a proper logarithmic resummation, 

® \Mn\'-^\MM\\ 

where at this point fin is an unspecified infrared scale. Second, to avoid large log^ Hn 
dependence when the phenomenological parton shower is applied, the infrared scale in the 
partonic calculation must be the same as the starting scale of the shower, 

(2) fin^ fin- 

Finally, the traditional approach to phase space involves two different phase space algo- 
rithms, a non-Markovian d^n generator and a Markovian MC^„ generator. For any specific 
phase space point, the partonic calculation is simply a number that is unchanged by the 
action of MC^„, so |A^n(Ain)P can be factored out separately for each n. The quantity 
J2n (d$„) involves multiple covering of phase space, since the action of the phenomeno- 
logical model starting from a phase space point in d$„ will populate regions of phase space 
d$m>n by the splitting of particles. To solve the problem of double-counting we introduce 
a /i-aware phase space generator 

(3) MC^„(d$„)^dMC„(/i„), 

which cleanly separates phase space populated by the d$„ generator and the subsequent 
evolution using the phenomenological model. These three steps lead to the GenEvA master 
formula from Eq. (|^) 

da™ = Y,\^n{ftn)\'dMCnifin) - (4) 
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FIG. 3: A schematic summary of the GenEvA master formula. The fully exclusive differential cross 
section comes from interfacing a partonic calculation with a phenomenological model 

through the use of Monte Carlo space dMC(/i). The matching scale fi allows this interface to be 
smooth and defines a clean separation between calculational and algorithmic issues. 
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FIG. 4: The user's view of the GenEvA framework. Partonic calculations as defined by 
are used above a universal scale fi and can be distributed with the GenEvA algorithm Q. A phe- 
nomenological model is used below the scale /i and is implemented using an unspecified algorithm, 
usually a parton shower interfaced with a hadronization scheme. Because the same scale // defines 
both the calculational separation and the algorithmic separation, the user is free to choose the best 
partonic calculations from physics considerations alone, without having to worry about the details 
of the algorithmic implementation. 



There are two interesting features of Eq. (^). First, as summarized in Fig. |^, this formula 
can be interpreted as an explicit separation of calculational issues from algorithmic issues. 
|-^n(/^n)P certainly encodes the challenges of merging together different QCD expansion 
schemes into a single partonic cross section. dMC„(yU„) is slightly more complicated because 
it encodes both algorithmic information as well as the physics of phenomenological models. 
However, to the extent to which we (unfairly) regard phenomenological models as QCD- 
inspired numerical algorithms, dMCn(/iri) encodes the algorithmic challenges of distributing 
QCD-like events. Because calculational issues are now separated from algorithmic ones, 
the partonic calculations |A^„(/i„)P can be determined through physics consideration alone, 
independently of the details of dMC„(/i„). 

Second, as summarized in Fig. ^, the same scale fin appears both in |A1n(Ain)P and in 
dMC„(/i„). A potential mismatch as in Fig. ^ between the calculational scale and the 
algorithmic scale /i is avoided because these two scales are now forced to be the same to 
cancel the as log^ fx dependence. In this way, all the user needs to know is that full QCD 
information as encoded in |A^(/i)p will be used above /z, and phenomenological models of 
QCD (which the user need not specify) will be used below fi. Of course, the precise definition 
of n must always be specified, and in this paper, we will use the invariant mass between 
particles to define fi, leaving a discussion of generaUzations for the companion paper [|1|] . 
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FIG. 5: The scale-dependent phase space d$„,({/ij}j<„). The regions of d<I>3^4 labeled /i2 are those 
reached by the phenomenological model acting on d<I>2 with starting scale fi2- The region of d$4 
labeled /U3 is reached by the phenomenological model acting on d<I>3 when started at the scale 
H3. Excluding these regions from d$3^4 defines d$3(/i2) and d<&4(/i2, /^s)- When interfaced with 
a phenomenological model, the set of all d<I>„({;Uj}j<„) regions gives a complete covering of phase 
space with no double-counting by construction. 

In the remainder of this section, we will consider the various elements in Eq. (§) in more 
detail, starting with a more precise definition of the phase space dMC(/i), explaining how 
to include additional flexibility in defining more scales like n, and then discussing methods 
to determine the partonic calculation |A^(/i)p. 

C. A New Approach to Phase Space 

As discussed in the previous section, the GenEvA framework requires a phase space gen- 
erator dMC{fi) that, by construction, does not introduce double-counting when interfaced 
with a phenomenological model for filling out phase space. To be specific, we take the phe- 
nomenological model to be a parton shower that takes an n-body configuration in d$„ 
and starts showering at some scale /i„ to produce additional partons. In other words, the 
parton shower produces m-body configurations with m > n, where the precise regions of 
higher-dimensional phase space that are populated are determined by the choice of starting 
scale fin and the details of the parton shower. Calling these regions MC„(/i„), we can think 
of the parton shower as defining a map 



To eliminate double-counting, we simply have to exclude those regions from d$„ that 
can be reached by acting the shower on d$j starting from the scale /Zj for any i < n. As 
illustrated in Fig. ^, the remaining parts of d$„ define phase space with a matching scale 
d$„({/ii},<„).^ Note that d$„({/ii} i^n) is a function of all the lower- dimensional matching 




(5) 



^ In Fig. 1^, d$4(/i2, Ms) strictly speaking only depends on fj,^ since the region of d<l>4 mapped out by fj,2 lies 
completely inside 113. In general, this need not be the case, see for example Fig. |ll|. 
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FIG. 6: Definition of "Monte Carlo space" dMCnifJ^n) as all regions of scale-dependent phase space 
d$„({/ij}i<„) that can be reached by the phenomenological model starting from dMC2(/i2) 
consists of d$2 together with the part of d$3^4^... labeled by fi2 in Fig. |5|, dMCs{fis) consists of 
d$3(//2) together with the part of d<I>4^... labeled by fis (but excluding that labeled by fi2), and so 
on. This way of organizing phase space emphasizes that while partonic calculations are defined on 
scale-dependent phase space d^'(^), they affect all regions of Monte Carlo space dMC(/u) through 
the phenomenological model. 

scales {/ii}i<n and is independent of the matching scale 

Having solved the issue of double-counting, it is now more convenient to directly talk 
about "Monte Carlo space" dMCn({/ij}j<„), which is defined as d$„({/ij}j<„) plus the col- 
lection of all relevant regions excluded from d$m>n, as illustrated in Fig. ^. More precisely, 
dMCn({/ii}j<n) is the image of d$„({yUj}i<n) under the parton shower map in Eq. @. To 
simplify our notation, we will mostly suppress the implicit dependence on the scales /ii<n 
and only write dMC„(/i„). 

By construction, a covering of phase space with no double-counting is given by the map 

E d^n{M.<n) E dUCM , (6) 

n=2 n=2 

where ri^ax will be determined by the maximal available number of external particles in 
the partonic calculation. As long as the scales fin for n < rimax and the parton shower 
satisfy certain mild constraints to guarantee no dead zones, Eq. (^) also provides a complete 
covering of phase space which is one-to-one and onto, i.e. every region of phase space is 
covered exactly once: 

EdMC„M^Ed$„. (7) 

n=2 n=2 

Making the dependence on n^ax explicit, the GenEvA master formula is 

da=J2\Mn{fin)\'dMCM. (8) 

n=2 
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FIG. 7: Phase space with a variable resolution scale. The shaded blobs represent one possible phase 
space projection map as in Eq. (^) based on successive 2 — > 1 recombinations. At the matching 
scale fj,, the event has 7 final state partons. Going from the matching scale fi to the resolution 
scale p,5 maps a 7-body configuration to a 5-body configuration. Further increasing the scale to 
/is and fi2 = Eqm yields a 3-body and eventually a 2-body configuration. The ability to project 
a given point of phase space to a lower-dimensional phase spaces allows for partonic calculations 
. . .)p that depend on multiple scales {/x}. 

A crucial point is that an n-body partonic calculation |A^„(/in)P is defined just as a function 
of but the extrapolation to all of dMC„(/x„) is provided by tlie plienomenological model. 
In principle, all Eq. (||) requires is a phase space generator that can generate scale-dependent 
phase space d$(/i). Such a generator could be built out of existing Monte Carlo tools by 
including a phase space veto. Because d$(/i) implicitly depends on the precise choice of 
phenomenological model, doing so might be nontrivial, but there is no conceptual difficulty. 
Once such a generator exists, we can think entirely in the language of partonic calculations, 
but still generate fully hadronized events with the aid of QCD phenomenological models. 

D. Variable Resolution Scale 

When discussing strategies for calculating the partonic result |A^(/i)p, it will prove very 
useful to define a projection map from n-body phase space to m-body phase space 

$ri $m with m < n. (9) 

This phase space projection map will be the key to creating our NLO/LO/LL merged sample. 
Such a map can be achieved by defining a resolution scale /i, which would resolve more and 
more partons as the scale is lowered. Note that this scale does not have to coincide with the 
scale /i we use to divide up phase space. 

As we raise the scale /2, the projection map will cluster together two partons once their 
"distance" becomes smaller than ft. The phase space map given in Eq. will give $„ $m 
at the resolution scale flm- As an example, consider the event shown in Fig. 0, where at the 
matching scale fi there are 7 resolved partons. At the scale /is, two pairs of particles have 
been clustered together such that the event has 5 resolved partons. Similarly, at the scales 
/is and /i2 = EcM, we have 3 and 2 resolved partons, respectively. 
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There are several such maps available in the literature including the kx algorithm pHl |69 



and the momentum mappings often found in dipole showers |2^, |70[. In fact, any recursive 
cluster-based jet algorithm is an example of such a map, but to be useful, Eq. (||) has to 
respect the symmetry and singularity structure of QCD, which means that the clustering 
procedure should respect flavor, and the scale /i should roughly determine the "soft-collinear 
distance" . Such maps give a sequence of resolution scales, and in the past, it has been mainly 
the ability to obtain such a sequence of scales which has been used in the literature. An 
example is the CKKW procedure [^, which uses a flavor-aware variant of the kx algorithm 
to identify the scales needed to implement leading-logarithmic improvements to tree-level 
calculations. In GenEvA, the actual four- momenta determined by Eq. (P) are also important, 
and we crucially assume that the projection Eq. (|^) returns a set of on-shell four-momenta, 
such that partonic calculations can be deflned in terms of 

)?■ (10) 

In the GenEvA algorithm [0, we will show that we can use the evolution variable of the 
parton shower itself as the resolution variable. Thus, the projection in Eq. will effectively 
be the inverse of the parton shower map in Eq. (|^).^ This reversal property of the projection 
means that we have a consistent way to not just increase but also decrease the matching scale 
at will. This could potentially be very important because it allows one to raise the value 
of '^max in Eq- (iP and thus use additional partonic calculations after the events have been 
generated or even passed through a detector simulation. In fact, a reversable projection can 
be used to define dMC(/i), as any given phase space integration region d$„ can be mapped 
to the proper dMCm (/im) region as needed in Fig. |^. The presence of a reversable phase 
space projection is one of the reasons why the GenEvA algorithm is efficient and versatile, but 
this reversibility property is not strictly needed for the GenEvA framework discussed here. 



E. Strategy for Partonic Calculations 

In the language of Eq. (0), the strategy to systematically improve Monte Carlo is simply 
to eliminate phenomenological models by pushing the scale /i as low as possible (ideally to 
Aqcd) with improved QCD partonic calculations |A^(/i)p. Of course, one would also like to 
refine and tune phenomenological models based on experimental and theoretical input, but 
that is not the focus of the present work, where we are concerned primarily with improving 
the perturbative description of QCD. 

How does one determine a partonic calculation As discussed, the choice can 

be made by physics considerations alone without worrying about algorithmic implications,^ 
and the physics goal is to determine an expression for |A^(yu)p that is formally correct both 
in a perturbative expansion in as and in a kinematic expansion about r, where r denotes the 
ratio of two kinematic scales, such as the invariant mass between two partons compared to 
the center-of-mass energy. The details of how to define partonic calculations are discussed 
in subsequent sections, and we give just a schematic overview here. 



^ Strictly speaking, Eq. will be a pseudo- left-inverse of Eq. (H), meaning that running the shower and 

applying the projection will yield the identity map up to discrete ambiguities. 
^ In practice, the availability of fast numerical methods to evaluate might affect the precise form 

of the partonic calculation, especially in the presence of loop expressions. 
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As is well known, an expansion to fixed order in as does not yield a good description 
of perturbative QCD over all of phase space. The presence of double-logarithmic terms 
{as log^ r)" at each order in the perturbative expansion invalidates the fixed-order expansion 
for small values of r. These double-logarithmic terms can be resumed to all orders in 
perturbation theory by doing a soft-coUinear expansion of QCD. In this paper we will only 
work to leading-logarithmic order, leaving a treatment of subleading logarithms for future 
work. 

The most naive partonic calculation is one that completely avoids the issue of logarithmic 
resummation and only uses tree-level (LO) matrix elements 

\M]fif,)\'^\MTr- (11) 

Note that the infrared divergences in the tree-level diagrams are regulated because phase 
space has a matching scale /i that imposes a phase space restriction. 

Because of the double-logarithmic sensitivity to the scale /i from running the phenomeno- 
logical model, we want to supplement with the correct leading-logarithmic (LL) 

fi dependence. As observed by CKKW [^, this can be accomphshed by multiplying the 
tree- level diagram with a //-dependent Sudakov factor A, 

^ |^ri'A(^CM,/i). (12) 

Because the Sudakov factor can be formally expanded as A = 1 + 0{as), this LO/LL 
partonic cross section is simultaneously correct to leading order in an as expansion and to 
leading- logarithmic order in the soft-coUinear limit. 

The story becomes more complicated at next-to-leading order (NLO) because of the 
need to cancel infrared divergences between ra-body virtual and (n -|- l)-body real-emission 
diagrams. Though in practice, one typically uses Catani-Seymour subtractions 173] to 



calculate NLO observables, for the moment, we will use a slicing method |T^, |75|, |73, 



78| , which is conceptually simpler. Schematically, one writes 

|A^r°(M)l' ^ \MTf + l-Mf'-^f + / \MZ\\' , (13) 

J fi 

where the integral represents slicing part of the (n + l)-body real-emission phase space 
to cancel the infrared divergences in the n-body virtual diagrams. Any divergences in the 
n-body real emission are regulated by the matching scale /i as in the tree-level case, but the 
effect of those divergences can be used again in a slicing scheme to cancel infrared divergences 
in the [n — l)-body virtual diagrams. 

Adding leading-logarithmic information to the NLO calculation is much more involved. 
A naive approach is simply incorrect, 

because the 0{as) pieces in the Sudakov factor change the 0{as) behavior of the fixed- 



order calculation. An achievement of MC@NLO was to figure out an expression for 
|_y^NLO/LL(^^)|2 ^j^g^^ simultaneously has NLO and LL accuracy. In Ref. a specific algo- 
rithmic method to implement \A4^^'~'^^^(fi)\'^ was used that generated events with sometimes 
negative weights, but in the context of GenEvA the same \Ai^^'^^^^{fi)\'^ expression can be 
used to generate NLO/LL accurate events with manifestly positive weights. 
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At this point, GenEvA is simply reproducing known results from PS/ME merging (LO/LL) 
and PS/NLO merging (NLO/LL) in the language of partonic calculations |A^(/i)p. The 
advantage of this language is that it makes it straightforward to merge these two results 
into a partonic calculation that has combined NLO/LO/LL accuracy. As already mentioned 
in Sec. ^ U\ , and as we will explain in more detail in Sec. |3 b\ one can define a scale fim > A* 
at which only m < n partons are resolved. This can be achieved using the phase space 
projection map of Eq. (^. If one has LO/LL information for n partons, and additional 
NLO/LL information for m partons,'' one can supplement the LO/LL partonic cross section 
with this higher-order information, using 

|_^NLO/LO/LL^^)|2 ^ |^^LO/LL^^) |. 'j^^^^^^^ . (15) 

This result describes NLO observables correct to NLO, LO observables correct to LO, all 
with the correct LL behavior. 

Though we do not demonstrate an NNLO/NLO/LO/NLL/LL merged sample in the 
present work, whether or not such a sample can be built in principle only depends on 
whether it is possible to derive an expression for that simultaneously allows N*LO 

observables to be correct to N*LO and N-'LL observables to be correct to N-'LL. Of course, 
since we will interface with a phenomenological model that might not have the correct NLL 
behavior, we now have to push /i as low as possible to make sure that our N^LO/N-'LL 
calculation describes as much of phase space as possible. Using the language of partonic 
calculations, we see that how far Monte Carlo can be improved on the perturbative side is 
in principle a pencil-and-paper question that can be separated from algorithmic issues. 



F. Summary of GenEvA 

Just as "fixed-order calculation" and "parton shower" referred to various concepts, 
GenEvA itself refers to four different concepts, summarized in Fig. ^. GenEvA is: 

1. A Monte Carlo framework based on the idea of generating events according to 

da = \M{fi)\^ dMCifi) (16) 

by distributing a generic partonic calculation with a matching scale, |A^(yu)p, using 
a generic phase space generator with a matching scale, dMC(/x). This framework 
could be implemented using traditional Monte Carlo tools without ever referencing 
the GenEvA phase space algorithm. 

2. A strategy for improving Monte Carlo based on merging different QCD expansions 
to determine the best partonic calculations |A^(/i)p. This strategy will be the subject 
of the remaining sections and yields a formula for an NLO/LO/LL merged calculation 
and should be generalizable to NNLO or NLL. 



^ Here, NLO/LL information for m partons means that one has performed an (m — l)-body one-loop 
calculation. 
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FIG. 8: The four different meanings of "GenEvA". The GenEvA framework is an umbrella for the 
idea of distributing partonic calculations with a matching scale, across phase space with 

a matching scale, dMC(/i). The GenEvA strategy asserts that different QCD expansions can be 
merged into a single partonic quantity |7W(/u)p. The GenEvA algorithm is a method for using a 
parton shower as a phase space generator. The GenEvA program is a concrete implementation of 
the GenEvA framework employing the GenEvA algorithm. 

3. An algorithm for generating phase space with a variable matching scale, dMC(/i), 
by analytically reweighting a parton shower to fixed-multiplicity phase space. This 
algorithm is detailed in Ref. [0]. 

4. A computer program that gives a concrete implementation of the GenEvA framework 
by using the GenEvA algorithm. An alpha version of this software is available from 
the authors upon request, and is used here to show the results of the NLO/LO/LL 
calculation. 

We emphasize that while the GenEvA algorithm for generating phase space is quite novel, 
the important physics behind the GenEvA framework is captured not by the phase space 
integration dMC(/i) but by the GenEvA strategy to define the partonic calculation |A^(/i)|^. 
Though we need a phase space generator with a variable matching scale in order to create an 
NLO/LO/LL merged sample, the physics behind this merging comes from finding a suitable 
form of |A^(/i)p. 

To summarize the complete GenEvA approach to Monte Carlo, it is instructive to separate 
the phenomenological model into a parton shower component and a hadronization compo- 
nent. If we now divide event generation into three regimes as in Fig. |^ — partonic, showering, 
and hadronization — then the GenEvA framework improves our ability to describe the partonic 
regime through improved partonic calculations |A^(yu)p. The showering regime can either be 



described by a traditional showering program such as Pythia |P, or Herwig |TT|, 0, |T5l or 



by the internal parton shower used in the GenEvA algorithm. Regardless of which method 



As we discuss more in the companion paper (l), the reason to use a traditional algorithm is that they 
have already been tuned to data, though there are some technical issues regarding evolution variables that 
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FIG. 9: The three regimes of event generation. Left panel: The GenEvA algorithm could be 
used only in the partonic regime, with the showering regime covered by a traditional, already 
tuned, showering program. Right panel: Alternatively, both partonic and showering regimes can 
be covered by the GenEvA algorithm using its internal parton shower. In this case the matching 
scale between the partonic and showering regimes remains freely adjustable through the reversable 
phase space projection. While these two approaches differ in how Sudakov factors are calculated, 
they both give results that are accurate to leading-logarithmic order. 



is used, there will be no leading- logarithmic /i dependence in the event generation as long 
as |A^(/x)p contains the correct Sudakov factors and the same definition of /i is used in the 
partonic and showering regimes. Finally, the hadronization regime uses some model of the 
strong interactions to hadronize the obtained partons into the hadrons observed in the col- 
lider, and GenEvA inherits the same smooth shower ing/hadronizat ion interface as traditional 
frameworks. 



3. GENEVA IN A TOY EXAMPLE 



To see how the GenEvA framework works in practice, it is instructive to first consider a toy 
example that includes all of the issues involved in constructing partonic calculations |7Vl(/i)p, 
without the technical complications introduced by full QCD. In particular, the toy theory 
will only have a single unambiguous scale. In actual QCD, the scale at which an emission 
occurs will be ambiguous, both because of the ambiguity in the choice of evolution variable 
and the ambiguity of how to pair together daughter particles to form mother particles. We 
will see how to deal with these additional complications in Sec. ^. 

To build the analogy with QCD, we need an example theory with both analytic cross- 
section information as well as a separate phenomenological description. We will use a slightly 
modified version of the toy example introduced in Ref . |Q . After reviewing the toy theory, 
we will describe "Monte Carlo space" dMC(/i) and the kinds of partonic calculations |A^(/i)p 
that can be distributed across Monte Carlo space. 



would have to be overcome to use this option. If the GenEvA algorithm is used in the showering regime, 
then the user gains the freedom to adjust the scale used in the partonic calculations even after detector 
simulation. 
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FIG. 10: Structure of the toy theory. Starting at the scale x = 1, the system can generate 
"radiation" at scales xi, X2, with the Xi values always decreasing. At the scale Xcnt, the 
perturbative radiation matches onto a "hadronization" scheme. 



A. Review of Toy Theory 



Consider a system that can radiate off "photons" as in Fig. |T0|. After emission of a 
photon, the system has energy x left for further emissions with 

< X < Xstart < 1 , (17) 

where Xgtart was the energy of the system before the emission (with Xstart = 1 initially), so 
the energy of the radiated photon is Xstart — x.^ The different phase space integration ranges 
for n emissions are given by 

d$o = 1 ; cl$i = dxi , d$2 = dxi dx2 9{xi > X2) , . . . , 

" (18) 
d$„ = dxi ]^dxj6'(xi_i > Xi) , 

1=2 

and the restriction < Xj < 1 is implied. 

The Born "cross section" corresponds to no emissions at zeroth order in perturbation 
theory. At first order in perturbation theory, there is a contribution to the cross section 
from real and virtual photon emissions. We define 

-Bd{x), — — = a[— + V\d{x), — — = aB , (19) 



da; dx \2e J dx 

where we are doing a perturbative expansion in a, and e is the dimensional regularization 
parameter in d = 1 — 2e dimensions. We need 

lim R{x) = 1 , (20) 



^ Unlike Ref. |Q , we take x to be the energy of the system available for radiation, rather than the energy 
of the emitted photon. Thus, x is continuously decreasing and plays the role of a parton shower evolution 
variable. Note that this means that this model has the somewhat unusual property of having singularities 
if the photons radiate with maximum energy, rather than with minimum energy. 
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such that the integral over the real contribution cancels the infrared divergence in the virtual 
contribution, 

\ R{x) f\ 1 f\ R(x)-1 1 ^ . 

d^^= / / dx^^ = -- + finite, 21 

Jo Jo ^ 2e 

yielding the total cross section to 0{a) 

(Jnlo = B + aV + aB [ dx ^''^^ ~ ^ . (22) 

JO ^ 

Next, we define a function Q{x) which plays the role of the splitting function in QCD 
and reproduces the full radiation in the singular limit x ^ 0. That is, from Eq. (pOD we 
require 

lim Q{x) = 1 , (23) 

x^O 

such that 

lim ^5M^^ = finite. (24) 

The simplest choice would be Q{x) = 1. In general, a nontrivial dependence on x is possible, 
in which case we require Q{x) to be positive between x = and x = 1. The function Q{x) 
can be used to define 

Q{x) 



Aq(xi,X2) = exp 



CXl 

-a I dx 



X2 



(25) 



which plays the role of the Sudakov factor, and which we will often just call A for simplicity. 
Finally, the splitting function together with the Sudakov factor can be used to define a 
parton shower with the differential probability to branch (emit a photon) at x given by 

dP(x) = a ^ A(x,tart, x) dx . (26) 

X 

As before, Xgtart is the energy of the system prior to the branching. We also define a lower 
cutoff Xcnt on the parton shower, which corresponds to the scale at which a QCD parton 
shower would be matched onto a hadronization model. In the following, the parton shower 
defined by Eq. (120) will serve as the phenomenological model in our toy theory. 



B. Phase Space with a Matching Scale 



As discussed in Sec. |2 C| , to avoid any double-counting between the parton shower started 
from a system with n emissions and one with m > n emissions, we want to define phase space 
in the presence of matching scales. Because we are working with a single-scale system, it is 
straightforward to amend the ordinary phase space d$„ in Eq. ([T8| ) with a set of matching 
scales {/ii}i<„. We have 

n 

d<l>„({/ii}i<„) = dxi 9{xi > fio) Y\ dxi 9{xi^i > Xi) 0{xi > fii^i) , (27) 

i=2 

where the restriction < < 1 is again implied. An illustration of Eq. (p^) for n < 2 is 



shown in Fig. 11. Before going on to explain Eq. (B^) in more detail, we want to highlight 



a few main points: 
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FIG. 11: The scale-dependent phase space d$„({/ij}j<„) in the toy theory. The scale fiQ is a fixed 
number, whereas fii = /ii(xi) can be a function of xi. d<I>i(/io) is given by Xi > ^q, and d<I>2(^05 l^i) 
is given by xi > /io and /ii(xi) < X2 < xi. 



1. For each n, the scale /i„ is the scale at which we start the "parton shower" when 
acting on n-emission phase space d$n. In general, fin = f^ni^n) can be a function of 
= {xi}i<n- This can lead to some counter- intuitive but still well-defined situations. 
The example shown in Fig. 11 has a seemingly pathological situation where /ii(xi) < fio 
for certain values of Xi, but this is perfectly consistent and even potentially useful. 
The only restriction on the functional form of for Eq. ( jn\ ) to make sense is 

that /in($n) < Xn- 



2. In accordance with our general discussion in Sec. \i (J|, the restriction Xj > in 



Eq. (|27D cuts out the region of d$„ for any n > i that gets populated by the parton 
shower when acting on d$j_i, thus guaranteeing that there is no phase space double- 
counting. 

3. If we have only calculations for up to ^max emissions, then to avoid dead zones in phase 
space, we have to start the parton shower at Ainmax('^nmax) = ^n^ax when acting on the 
^max-emission contribution, such that d$„ for n > n^ax is completely covered by the 
parton shower. The easiest way to get full phase space coverage with no dead zones 
is to simply use a single fixed matching scale /i, with /i„($„) = /i for n < rimax and 
/^nmax(^nmax) = ^n^^^- Thls also avolds some of the seeming pathologies mentioned 
above. 

To explain Eq. in more detail, we take a closer look at how the toy parton shower 
works. Let MCx-i^^x2,...{^) denote an event for which emissions have occurred at xi, X2, ■ ■ ., 
and the system has energy x left to radiate. An event in its initial state is denoted by 
MC(1), which means that no photons have been emitted, and the total energy available for 
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radiation is x = 1. The event then evolves according to the parton shower Eq. (pB]), giving 



MC(1) ^ A(l, xcut) MC(xeut) + / dxi 

J XcvlX 
1 

da;i 



a 



A(l,Xcut)MC(Xeut) + 
• 1 pXx 



a 



Xi 



-A(l,a;i) 



Xi 



+ 



dxi 



dxo 



a A(l, xij 

Xi 



a Afxi, X2 

X2 



Ma,(xi) 

A(Xi,Xcut) MC^^(Xcut) 



(28) 



The first term denotes the possibihty that no emissions happen down to the "hadronization 
scale" Xcut, which occurs with the no-branching probability A(l,Xcut) and leaves an event 
with energy Xcut left for radiation. The second term denotes the possibility that exactly one 
emission happens above the scale Xcut and so on. The Sudakov factors are defined in such a 
way that the total probability is conserved, 



' dxa^A(l,a;) 

X 



A(l,x, 



(29) 



^cut 



so if we start with some number of events MC(1), the number of events after showering will 
still be the same. 

Now imagine running the parton shower starting at the scale /!„(<!>„) on an event for 
which n emissions have occurred at = {^^i, . . . , In this case 



MCa;i,...,a;„(/in) ^ A(/i„, Xcut) MC:i.,,...,a;„ (Xcut) 



+ 



dx. 



n+l 



^cut 



a 



Q{x. 



n+l, 



Xn+l 



ni X-n+l J 



(30) 



By assumption, the emissions are ordered with Xn+\ < Xn, so for Eq. ( pO|) to be consistent, 
we need to choose /i„($n) < Xn- If in addition we consider an (n + l)-emission event 
n+iifJ-n+i), then to avoid double-counting with the n-emission event, we have to 



MC 



■^1 V •i-^ni^' 



restrict the range of Xn+i to Xn+i > /in- Similarly, considering the effect of all of the 0- 
through n-emission samples, to avoid double-counting regions of phase space when running 
the parton shower, we always need to take Xi > as anticipated in Eq. (p7[). 

Putting together the phase space restriction from Eq. ([27| ) with the parton shower evolu- 
tion of Eq. (p8D, we can now define differential "Monte Carlo space" dMC„(/i„) as the result 
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FIG. 12: Definition of "Monte Carlo space" dMCniHn) in the toy theory, which is obtained by 



collecting the various parts of in Fig. 11 



of running the parton shower on n-emission phase space d$({;Uj}j<„): 

dMCn({/Xi}i<n) = d$n({/^i}i<n) | A(/i„, Xcut) 



a 



A(/i„,a;„+i) 



n+l 



+ dx„+i da;„+2 



X. 



X 



a ■ 



Xn+2 



+ ■ 



(31) 



where the restriction < Xj < 1 is again implied. Note that the Monte-Carlo space 
dMC„({/ii}j<„) has variable multiplicity. An illustration of Eq. (|3ll) for n < 2 is shown 
in Fig. 0. 

With the appropriate choice of as discussed above, 

dMC„(/i„) ^ ^^n{Xcnt) (32) 
n=0 n=0 

gives a covering of phase space with no dead zones and no double-counting. Here, d$„(xcut) 
represents running the hadronization model on the partons at the scale Xcut- The arrow 
indicates that dMCn(/i„) includes additional weight information from the splitting functions 
and Sudakov factors. 



C. The Master Formula 

A compact master formula for the fully differential cross section coming from our as-of-yet 
undefined partonic calculation interfaced with the parton shower is given in analogy with 
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Eq. (D: 

^max 

da=J2 \Mn{fin)\' dMCnifin) • (33) 
n=0 

As already mentioned, |A4n(/in)P is a function of More generally, |A^n(/^n)P can also 
depend on the scales {/ii}i<n that are implicit in dMCn(/in)- 

It is at this point that the different concepts behind the parton shower come into play. 
As an instructive exercise, imagine we only know the Born cross section B and the splitting 
function Q{x). In that case, the best approximation to the differential cross section we can 
write down is 

da = |A^oMI'dMCo(/io) with /io = 1 , = = B . (34) 

However, there is an entirely equivalent description of the same physics in terms of the 
differential cross section 

da= |A<o(/io)|'dMCo(/io) + |A<i(/ii)|'dMCi(/ii), (35) 

where we now have to choose /xi = xi to avoid dead zones, and the partonic calculations are 
defined as 

Parton Shower (LL): 

|A^o(/io)|' = 5A(l,/io), 

\Miixi)\^ = aB^^A{l,x,). (36) 

Xi 

In Eq. (0), we would say that the boundary between the partonic calculation and the parton 
shower is /i = 1, whereas in Eq. ([36|), we would say that the boundary is some scale fi = /io- 
This is an example of the ambiguity of the parton shower having a meaning above or below 
the scale /x, as discussed in Fig. 0. This flexibility in changing the value of /i is a feature and 
not a bug, and it comes from the fact that the parton shower is both a phenomenological 
model based on QCD and a well-defined expansion of QCD. The only difference between the 
two descriptions are the words and perhaps algorithms we use above and below fi. But the 
freedom to incorporate a Sudakov factor A into a partonic calculation |yW(/i)p as needed 
is what makes it possible to define partonic calculations that include leading-logarithmic 
resummation. 



D. Partonic Matrix Elements for the First Emission 

With the toy master formula Eq. ( p3D in hand, we simply need to define the best partonic 
calculations |A1(yu)p we can with the available information from the toy theory. Apart from 
issues of Monte Carlo efficiency, this definition can be dictated by physics considerations 
alone, as the GenEvA framework can distribute any |A^(/i)p across dMC(yu). We will start in 
this subsection by only considering the first emission in our toy theory, writing for simplicity 
X = Xi- The generalization to multiple emission will be given in the next subsection. Keeping 
in mind the freedom offered by Eq. (|35|), we can define everything in terms of 

\Mo{fio)\^ and \Mi{fii = x)\^ . (37) 
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I-Mo(mo)P 


x\Mi{x) 


\' 




Parton Shower (LL) 




aBQ{x) 


Aq 


{l,x) 


Tree-Level (LO) 


B 


aBR{x) 






Sudakov-Improved (LO/LL) 


BAq{1,ho) 


aBR{x) 


Aq 


(l,x) 


NLO Slicing (NLO) 


B{fio) 


aBR{x) 






NLO Subtraction (NLO/LL) 


S(/Xo)Aq(1,/Xo) 


aB {R{x 


)- 


Q(x)) + aB{^io)Q{x) Aq{1,x) 


NLO Elegant (NLO/LL) 


0"NLO At(1,/^o) 


aBR{x) 


At 





TABLE L Summary of the various possible one-emission partonic calculations in the toy theory. 
At the one-emission level, all of the theory information can be summarized by a zero-emission 
calculation ITWqCa'o)!^ with a matching scale fiQ and a one-emission calculation with a 

matching scale = x. The definitions of the samples and symbols are given in the text. 



A summary of the partonic calculations in this subsection is given in Table |. 

The most naive use of the analytic cross section information from Eq. ([191) is to choose 

Tree Level (LO): 

\Moifio)\' = B , 



\Mi(x) 



aB 



R{x) 



X 



(38) 



The total and one-emission cross sections resulting from Eq. (|38|) are 



10 



a = B + aB Cdx ^^^^ 



X 



da _ ^ 1 r R{x) 

dx X \ (5(x) A(/io, x) 



for X > fiQ 
for X < fiQ 



(39) 



which is what one would obtain if events from a tree-level event generator were passed to a 
parton shower program. 

The main problem with the choice in Eq. ( pH]) is that it does not sum the leading- 
logarithmic contributions, which although technically of higher order in perturbation the- 
ory, can be numerically very important. This leads to the logarithmic dependence on the 
unphysical matching scale /iq in both a and da/dx in Eq. (|39|). The reason is of course that 
since the parton shower does include this resummation, it has logarithmic dependence on 
the starting scale /iq. Thus, if the tree-level results are combined with the parton shower 
using Eq. (Bl), we are left with this logarithmic dependence on /iq in Eq. (B^). 



In what follows, da/dx always refers to the inclusive differential cross section obtained by integrating over 
all values of Xi with i > 2. 

Actually, Eq. ( p8| ) is slightly better than what one would get by blindly running a tree-level event generator 
through a parton shower program, because implicit in the definition of |A^o(a*o)P is that the parton shower 
will run from fiQ to avoid double-counting. Out of the box, an ordinary parton shower program would 
start the parton shower at a; = 1 for the 0-emission piece, which would lead to double-counting. 
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To improve the situation, we can resum the leading logarithms in the partonic calculation 
by supplementing the naive tree-level result with Sudakov factors: 

Sudakov-Improved (LO/LL): 



5A(l,^o) 



(40) 



At leading order in a, this result is identical to the tree-level result Eq. (|38|). The resulting 
total and one-emission cross sections are now given by 



a = B + aB / dx — ^^A(l,x) 

da _ ^1 (Rix)A{l,x) 

dx x\ Q{x) A(l, jjo) A(/io, x) 



for X > fiQ 
for X < fiQ 



(41) 



The factor of A(l,/io) in da/dx for x < Hq comes from |A^o(/^o)P and cancels the Hq 
dependence in A(/io,x), which is generated by the parton shower. Furthermore, since we 
know from Eq. (|2^) that {R{x) — Q{x))/x is finite as x ^ 0, the logarithmic dependence on 
fiQ in the total cross section cancels, too. 

Since Eq. (^l]) does not exhibit leading-logarithmic dependence on the matching scale /xq, 
Eq. (HOf) provides a well-behaved merging between matrix elements and the parton shower. 



This result is consistent with the observation made in Ref. ||2^ that matrix element results 
need to be supplemented with Sudakov factors to allow for a merging with parton shower 
algorithms. What is important, however, is that we did not need a special algorithm to 
implement Eq. (^OD, because this is just a choice of what partonic matrix elements to use in 
the master formula Eq. (p^). Including the correct Sudakov factors is required by physics 
considerations alone and, for example, has nothing to do with the fact that the GenEvA 
algorithm happens to have an efficient mechanism to implement this LO/LL merged sample. 

The total and one-emission cross sections in Eqs. (^) and ( |4l| ) are correct to C(l) and 
0{a), respectively, which is of course because we only used tree-level information so far. 
To also get the total cross section correct at 0{a) (NLO) we need to improve the partonic 
calculation by including the virtual corrections to the zero-emission cross section, given by 
dav/dx in Eq. (^). There are numerous strategies one could consider. If we do not wish to 
reproduce the leading-logarithmic resummation, and only want to get observables accurate 
to NLO, then we can implement a naive slicing scheme ||73|, 



1 75 



77, 78 



NLO Slicing (NLO): 



|^o(/io)|' = 5(/io) 



\MAx)\^ = aB 



R{x) 



X 



where 



B{^q) = B + aV + aB lim 



B + aV + aB dx 
'o 



MO 



2e ^ JO 
R{x 



^\ Rix) 
dx 



l+2e 



X 



X 



(42) 



aB / dx 



Mo 



X 



(43) 
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The cross sections following from Eq. (|4^) are 



f R(x) f R(x) — 1 
cr = B(/io) + aB dx —-^ = B + aV + aB / dx = ctnlo , 

Jfio Jo ^ 

da 1 ( B R{x) for x > /io 

— = a - { ~ (44) 
dx X 1^ i?(/io) (5(x) A(/io,x) for x < /io • 

The total cross section now equals the correct NLO cross section unlo from Eq. (^) and 
has no Hq dependence. Furthermore, all observables are accurate to NLO in the fiQ 
limit. However, as in the tree- level result Eq. ([38|), the leading logarithmic dependence is 
not resummed, meaning that the one-emission cross section da/dx does not have the correct 
Sudakov suppression for small x and has large /Zq dependence from both -B(/io) and A(/io, x). 

To include the resummation, Sudakov factors must be included in the partonic calculation 
in such a way that all observables are still accurate to NLO when they are expanded in a. 
This can be achieved using a subtraction method similar to the one proposed by Frixione 



and Webber in MC@NLO ||4^. For the purpose of this discussion, we do not need to know 
the actual algorithm used in that paper to generate their results, only the corresponding 
partonic calculation itself: 

NLO Subtraction (NLO/LL): 

|A<o(/io)|' = 5(^o) AQ(l,/io), 
\M^{x, /io)P = aB ^^""^ ~ ^^""^ + aB{^^,) ^ Aq(1, x) , (45) 

where we made the dependence of A on Q{x) explicit, and we defined 
S(/io) = 5(/io) + aB j dx ^^^^ 



^0 Rix) - Q(x) 1 - Q(x) , , 

= B + aV + aB dx — =^ - aB dx =^ . (46) 

JO ^ Jo ^ 

Note that we have used the fact that the one-emission piece can in general depend on /io- 
The cross sections predicted by the substraction method in Eq. (^5|) are 

N o R{x)-Q{x) 
a = B{fio) + aB dx = ctnlo , 

da 1 ( B R{x) - Q{x)[B - B{i2o)Aq{1,x)] for x > /io 

— = a — < - (47) 

dx X yB{i2o)Q{x) AQ{l,fio)AQ{iJo,x) for x < /io • 

By expanding the above expressions to 0{a), one can easily show that the predictions 
of the subtraction method are equal at NLO to the predictions of the slicing method in 
Eq. (^41). Furthermore, in addition to producing the correct NLO total cross section, the one- 
emission cross section in the subtraction method has no leading-logarithmic /io dependence 
and exhibits the correct Sudakov suppression for small x. 

In Ref. [0 , the form of Eq. (|^) was selected mainly for its algorithmic simplicity and the 
lack of explicit e dependence. The important difference is that in Ref. 143] each of the two 
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terms in |A^i(x, /io)P is generated by a separate event sample. This can be inefficient and 
lead to events with negative weights, since in general, the relative size of the two terms can 
be very different, and the ffist term can also become negative. In contrast, in our approach, 
the form of \Aii{x, /xo)P in Eq. P5| ) is just another choice of a partonic calculation, so it can 
be used in the master formula Eq. (^) to generate a single, positive-weight event sample. 

Although we could in principle use Eq. ( ^Sj ) in our approach, it still has some drawbacks. 
First, due to the rather complicated structure of Eqs. and (^B]), they are tedious to 
generalize to more emissions or higher orders in a. Second, since |A^i(x,/io)P depends on 
the subtraction function Q{x), it is in general not positive definite, and for pathological 
choices of Q{x) can lead to negative cross sections. 

However, there is no deep conceptual reason why subtractions must be used to generate 
Monte Carlo. Indeed, there are many different ways to get results that are accurate at NLO 
after expanding in a, that have no leading-logarithmic /xq dependence, and that have the 
correct Sudakov suppression in the small x limit. One particularly elegant method proposed 
by Nason in Ref. |^ in a slightly different context is 

NLO Elegant (NLO/LL): 

\Mo{fio)\^ = (Tnlo AT(l,/io) , 

I p = a aNLo ^ At(1, x) , (48) 

where ctnlo is the total NLO cross section from Eq. (12^) , T{x) is an effective "splitting 
function" 

T{x) = Rix) , (49) 

cnlo 

and At is the Sudakov factor obtained from T{x) in analogy to Eq. (pSj). The cross sections 
are now 



C = 0"NLO 



Ar(l,Aio)+ / dxa^^AT(l,x) 



O'NLO 



^-aa ]^ fT{x) At{1,x) for x > fxo 

dx X \Q{x) Axil, fio) Aq{ij,o,x) for x < /io • 

As in the subtraction method, the total cross section is identical to the NLO result, and the 
one-emission cross section has the correct Sudakov suppression with no leading logarithmic 
/io dependence. Note that the latter again relies on the fact that R{x) and Q{x) have the 
same singularities for x — > 0, so and Aq resum the same leading logarithms. 

The form of Eq. (^S)) is identical to the merged LO/LL example of Eq. ( ^OD by replacing 
Q{x) —>■ T{x) and B (Tnlo- The nice feature of Eq. (^S[) is that it is not only conceptually 
simple, but it also uses only functions that are completely well defined in an NLO calculation 
without ever needing to introduce an ad-hoc subtraction function. It is also clear from the 
above expressions that the cross sections are always guaranteed to be positive. In Sec. we 
show how to generalize this method to the realistic case of QCD. On the other hand, there 
is also no reason why subtractions could not be used in GenEvA if a subtraction method is 
more convenient for theoretical or numerical reasons. 



^"^ Of course, for any reasonable choice of Q{x), Eq. ( p5| ) will lead to a positive weight NLO/LL event sample. 
Pathologies occur when, say, R{x) — Q{x) ~ 0{a^^). 
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E. Multiple Emissions 



The results from the previous subsection can be extended straightforwardly to multiple 
emissions in the partonic regime. At tree level, the partonic cross sections are given by 

Tree Level (LO): 

where the symbol f!^'^'^ represents an ordinary tree-level calculation in the toy theory with 
the a dependence and Xi singularities made manifest. For example, Jq'^'^ = B and f\^'^'^{xi) = 
BR{xi). We assume that in the singular regions of phase space 

lim ^"^1' ■ ■ ■ ' "^""1' /n-i (-^i) • • • 5 •^"-i)Q('^n) finite (^2) 

the analog of which is indeed true in the collinear limit of QCD as long as Q{x) is generalized 
to include flavor information. 

As discussed before, the tree-level results do not resum the leading-logarithmic behavior, 
which can be fixed by including Sudakov factors appropriately: 

Sudakov-Improved (LO/LL): 

l-Mn^/'"^ (/in) r = ""^^"'^(^i';^^'---'^") Aq{1, Xi) Aq(xi, X,) ■ ■ ■ Aq(x„, . (53) 



This is precisely the result that was advocated in Ref. [^] in order to merge matrix elements 
and parton showers. To see that Eq. (|53D is well behaved, note that the difference across 
the boundary Xn = fin-i is^^ 



e^oydXi---dXn dXi---dx„ J 



Xn=fln-l+£ 

nfn i^li ■ ■ ■ 1 ^n-li ^n) fn-li-'^li • • • i •^n~l)Qi^n) « 

= a Aq(1,x„ 

which has no large logarithmic dependence on assuming Eq. (|5^) holds. 

As the example most relevant for creating our NLO/LO/LL merged sample, we finally 
consider the combination of the NLO results with higher-multiplicity tree-level matrix el- 
ements. This will give a partonic calculation that reproduces the exact NLO results, but 
also uses the full tree-level quantum interference for multiple emissions. Using the previous 
definitions of unlo and T(x), we find: 



— fJ'n — l £ 

n+1 



+ 0(a"+i), (54) 

^n—f^n— 1 



Ref. also advocated to evaluate the couplings a at successive intermediate Xi scales in the problem, 
which in QCD corresponds to including extra subleading-logarithmic information. 



Eq. (54) is again the inclusive differential cross section, where we have integrated over all values of Xi with 
i > n. 
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Best Combination (NLO/LO/LL): 



Xi 



|_^NLO/LO/LL^^^) |2 ^ e.^^ (x^ , x^, ■ ■ ■ , x„) ^^^^^ Aq(xi, X,) ■ • ■ Aq(x„, . (55) 

For n = 1 and /ii = Xi this reduces to Eq. (^). 

The factor of At(1,xi) appears in |A^n(/Un)P to ensure that the total cross section has 
no large logarithmic dependence on fii.^^ In fact, this is the only difference between the 
LO/LL and NLO/LO/LL expressions for |A^ra(/in)p5 but it is crucial to ensure that all NLO 
observables (including the total cross section) are accurate to NLO with no large logarithms. 
In particular, it is straightforward to integrate Eq. (B^D to show that 



o" = (TNLo + a / dxi / dx2 At(1,xi) Aq(xi,X2) + (^(a ) 

(56) 

which has no large logarithmic dependence in /xq or /^i as long as Eq. (|52D holds. 



F. The Importance of Phase Space Projection 

In the above discussion, we never needed to talk about the phase space projection feature 
from Eq. (H) to write down our partonic calculations |A^(/i)p. The reason is that the toy 
theory is a single-scale theory, so phase space projection acts trivially. Given a phase space 
point {xi, X2, . . . , x„}, we can simply define the intermediate resolution scales as Jin-i = 
Xn-i) = and SO ou, such that the projected phase space points are 

{Xi,X2, . . . ,X„} ^ {Xi,X2, . . . ,X„_i} ^ {Xi,X2, ...,X„_2} (57) 

In full QCD, there are many more scale choices one could make for /i, and the difference 
between two different choices will yield logarithms (hopefully not large) of fiA/fiB- More 
importantly, in QCD there is an important requirement of final states being on shell, e.g. if 
we project two gluons into one mother gluon, we have to somehow shuffle momenta to put 
that mother gluon on shell. 

So why use phase space projection in QCD if it is so hard? In the language of Eq. (p3[), 
the reason is that we need some way of getting the analog of the factors of At to appear in 
|-^n(/^n)P- In some sense, A^ is a property of the NLO calculation that we want the LO 
tree- level matrix elements to inherit, but the tree- level matrix elements do not have enough 
information to properly calculate A^ themselves. But if we take an n-emission matrix 
element and project it to a one-emission matrix element, then we can use our existing NLO 



We cannot replace every Aq with At because in QCD there is additional flavor structure, which would 
make this choice ill-defined. 



30 



^CM 1 T 



Theory C 



Mbc 



Theory B 



Theory A 



A 



IR 



A 



IR 



B 



l>("(MBc)r 
I-M^(/^ab)P 



FIG. 13: Using nested descriptions to define the best partonic calculations. Given a theory A valid 
down to the scale n, the partonic calculation can be improved using information from a 

presumably better theory B which is however only valid down to a scale /xab > /-i-- The "matching 
coefficient" |A^^(/iAB)P/|A4^(/XAB)P allows the full event sample to capture all the physics of 
theory B, while still retaining information about theory A below the scale /xab- The same logic 
can now be repeated with a third theory C which is only valid down to a scale hbc > /"AB- 



machinery to NLO-improve the LO calculations. In formulas, we can write the n-emission 
cross section in Eq. (|55|) as 



I . .NLO/LL/_ _ N |2 
|_^NLO/LO/LL(^^)|. ^ |^^LO/LL(^^)|2 ^ " . (58) 

\M-^ [ill = xi)\^ 

The ratio on the right-hand side equals precisely At(1, a;i)/AQ(l, xi) and can be thought 
of as the "matching coefficient" between the NLO/LL calculation and the other Sudakov 
improved tree-level calculations. 

In fact, Eq. ( |5BD is just a specific case of a more general version of matching different 
partonic descriptions and running between them. Imagine, we have a set of nested partonic 
descriptions A, B, C, and so on as in Fig. [l^, then the best partonic calculation we can 
construct with the available information is 

Best Combination (A/B/C/- • • ) = 

where the various fi scales encode the multiplicities and energy scales at which the matching 
occurs. 

Though obscured in the dMC(/i) notation, but clear from Eq. (PBD, the NLO/LO/LL 
merged sample is equivalent to taking 

A = LL, B = LO/LL, C = NLO/LL, (60) 



where the LL description is just the parton shower written in the spirit of Eq. (|36D. The 
factors of |A^^^(yLi)p do not appear in Eq. ( pH]) because we use the parton shower not 



31 



as partonic calculation but as phenomenological model. Though we will not pursue this 
direction in the present work, we expect that creating an N*LO/N-^LL merged sample will 
just require finding appropriate descriptions A, B, C, and so on. 

As a side note, the reversable phase space projection in the GenEvA algorithm is 
equivalent to the statement that we have an analytic way of calculating |A1^^(/i)p for the 
parton shower at all scales /i. One of the reasons that the GenEvA algorithm is efficient 
comes from the realization that the Sudakov factors in 

cancel, reducing the number of calculations necessary to merge an LO/LL calculation with 
the parton shower. 

The next two sections deal with the technical complications introduced by full QCD. 
However, the main ideas are captured by the toy example just presented, so the reader not 
interested in the technical details can safely skip directly to Sec. ^ for the results of the 
NLO/LO/LL merged calculation in QCD. 



4. THE FIRST EMISSION IN QCD 

Having investigated the toy model in detail, we now move on to the realistic case of QCD. 
In this section, we will study how to implement the QCD cross section for e+e~ —>■ n jets up 
to 0{as), thus including the processes e~^e~ qq and e+e~ qqg. The extension to final 
states with more than three partons will be discussed in the next section, though most of the 
important physics considerations necessary to construct the NLO/LO/LL merged sample 
appear here. 

The essential complication for e"'"e~ n jets compared to the toy example is the pres- 
ence of multiple well-defined energy scales. This same complication appears in the process 
pp or pp ^ n jets, so we anticipate that the same solutions present in the leptonic case 
should have a generalization to the hadronic case. At the end of the day, these complica- 
tions are resolved by a thorough understanding of phase space, so we will begin with 3-body 
phase space before moving on to the relevant QCD calculations and the definition of the 
partonic matrix elements. 



A. Three-Body Phase Space 

At the level of phase space, there are two complications in QCD compared with the toy 
model studied in the previous section. First, a single emission in QCD is specified by three 
independent variables, as opposed to the single variable in the toy model. Second, on top of 
these three variables specifying the hadronic kinematics, there are two additional variables 
describing the orientation of the hadronic system relative to the e+e~ beamline. 

There are many equivalent ways of choosing these five variables. For our discussion, it 
will be convenient to use the thrust axis to define the orientation of the hadronic system 
relative to the beamline, and we use fi^ to describe this orientation. The remaining three 
variables describe the kinematics of the hadronic system relative to the thrust axis, which 
we choose to be the invariant mass ta between the quark and the gluon, the invariant mass 
tb between the antiquark and the gluon, as well as the azimuthal angle of the hadronic 



32 



system relative to the thrust axis. In other words, we decompose Lorentz-invariant 3-body 
phase space as 

d$3 = E^cM dfi^ dta dU d0 , (62) 

where = iPe+ + Pe-f and 

, {Pq+Pg? . _ {Pg+Pg? .Ro^ 

^o- - ' '^b - ^ • (OdJ 

-^CM -^CM 

For the remainder of the discussion, we will mainly ignore the Q2 and (f) dependence. 

The key to constructing the analog of the master formula in Eq. (^3|) is to have an 
unambiguous definition of the scale fi that will separate our QCD calculations from the 
phenomenological parton shower. For simplicity, we will assume that the parton shower 
uses virtuality 

as the evolution variable. We discuss the issues with extending our results to more modern 
Px-ordered showers in the companion paper To keep the notion of /i as a mass scale, we 
also define 

= # , (65) 

and use dimensionless variables in the remainder of this discussion. We anticipate running 
the GenEvA framework from 1 = Eq^^/ E^y^ until the scale /i^, attaching a parton shower 
starting from (i^ going down to tcut — ^irZ-^cm' ^^'^ finally interfacing with a hadronization 
model at tcut- 



The total 3-parton phase space < < 1 — tf, < 1 is shown in Fig. The 3-jet 
region 3 is defined as the region where both ta and are larger than (i^. The 2-jet region 2 
corresponds to both ta and tfe being less than fi^. Finally, there is an ambiguous 2^ region 
where ta is greater than fi^ and th is less than /i^, and a similar 2b region where the roles of 
ta and th are reversed. Formally, these phase space integration regions are defined as 

dta dtfc = dta dtb , / dta dtb = dt„ / dt, 



b , 

2 Jo Jo 

dta dtb = dtb dta , / dta dtb = dta dtb . (66) 

Jo Jfi'2 J2b Jo J(i2 

By definition, the regions 2, 2a, 2b are covered by running the phenomenological parton 
shower on 2-body phase space starting at ft^, with the 2^ and 2b regions arising from small 
virtuality, small angle gluon radiation. Hence, they belong to dMC2(/i^). The remaining 
part of d$3 is 3 = d^^i^fi'^), and running the shower on 3 defines dMCs^fil). Since rimax = 3, 
we need to pick an appropriate scale /i| to avoid dead zones, similar to our discussion in the 
toy model in Sec. P B| . What we need is the QCD-version of the condition /inmax = ^"max 
found in the toy model. Hence, we have to decide whether to take /i| equal to ta or tb, and 
in fact, either choice would leave no dead zones. 

For the moment, imagine using the parton shower to cover all of phase space. Given the 
hard scattering process e"'"e~ — > qq, we can run the parton shower from tgtart = 1- The first 
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FIG. 14: The available phase space < ta < 1 — th < 1 (ot e+e~ — > qqg, where are defined in 
Eq. (|6^). The separation between the 2-jet region and the 3-jet region is defined by a matching 
scale fi'^. Only region 3 has both ta and tb above the matching scale /i^ and therefore corresponds 
to the 3-jet region of phase space. The ambiguous 2 regions could be generated by a parton shower 
cut off at /i^ by "accidentally" populating a region closer to the QCD collinear singularity than /i^. 
However, that part is also generated by emissions below fP and is therefore covered by dMC2(/i'^) 
and must be vetoed in dMC3(/l3). 



emission can either come from a gluon being radiated from the quark at temit = ta or from 
the antiquark at temit = h- Regardless at what scale temit the emission occurs, the parton 
shower continues to run from that scale down, filling out the rest of phase space. Thus, 
knowing ta and tb alone does not permit a definition of the proper scale /i^ = temit, and a 
wrong choice could lead to potentially large logarithms log^(ta/tb). The challenge is that 
a point $3 gives us only access to ta and tb, and we have to decide how to properly account 
for the logarithms of the ratio of these two scales. 

We will discuss several ways to resolve this ambiguity in the definition of /i^, and for now 
we only provide a framework that allows the /t^ = ta vs. tb problem to be solved in principle. 
In particular, for now we will simply have two samples of 3-parton events, 3-parton events 
where we start the shower at ta and 3-parton events where we start the shower at tb, and let 
the user decide what partonic calculations to use for each sample. Thus, we will have three 
GenEvA event samples 

dMC^ifi') , dUC^ifil = ta) , dUCsifil = tb) , (67) 

which give a complete covering of phase space. To not introduce double- counting in the 
cross section, we now have to be careful to remember that there are two dMCs samples. 
We stress that this potential double-counting is conceptually very different from the double- 
counting between different phase space algorithms discussed earlier. Here, it arises from a 
scale ambiguity in QCD which has to be resolved by calculational means. 

The 2 regions are yet another example of a scale ambiguity and how the notion of "above 
H" and "below /i" can be so confusing. Since the dMC2 sample covers the 2a region, it has 
to be vetoed in dMCa, despite the fact that it could have been generated by the phenomeno- 
logical model with a large angle emission above /i^. As seen in Fig. |14|, a gluon emission 
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could have occurred at a large angle at a large scale, while for the purposes of phase space 
this is part of dMC2(/i^), and so the actual scale of emission should be considered below fi^. 



B. The Master Formula 

With the event samples of Eq. (0), we can write the differential cross section in terms 
of the master formula 

da = dMC2(A^) + dMC3(t.) + dMC3(t,) , (68) 

QSP2 'isP3 'isP3 

where we are using the notation dcrj/d$ instead of \J^i\^ because we anticipate integrating 
over some of the phase space variables to simplify expressions. The argument in the dcxj 
functions reminds us of the scale at which we start running the parton shower for those 
samples. To avoid double-counting regions of phase space, the cross sections dcr3a and da^h 
must satisfy 

dos ^ da^aitg) der3fc(tb) 

d$3 ~ d<I)3 d$3 ' ^ ' 

where d(j3 is the full 3-parton cross section. If we are not worried about large log^(ta/t6) 
logarithms, then a simple definition of the dcr3j partonic cross sections is 

d$3 d$3 d$3 d<l>3 

where aa($3) + af,($3) = 1. In the absence of leading-logarithmic information, any choice 
of Ua and a;, is equally valid, however in the presence of leading-logarithmic information the 
aj's have to be chosen to correctly treat logarithms of ta/h. 

With Eq. ( |55D in hand, we can now try to find the best definitions for the partonic cross 
sections do"j, but first we have to review the relevant QCD calculations. 



C. Relevant QCD Calculations 

The process e"*'e^ qqg is singular as U 0. To understand this singularity structure, 
consider the differential cross section in c? = 4 — 2e dimensions, and integrated over the three 
angular variables Vt^ and 0. One finds the well-known result 



da 



B 



h 



dtadtb 

day CisCp ( 4vr/i^ 



CM/ \ t ) 

2 



dtadtb 27r V 

don asCp (AniilX 1 - 2e ^ R{ta,tb) - 2e{ta + tbf 



dtadtb 27r V^cm/ r(2-2e) (1 _ 4 _ t,)2.ti+2.^i+2e ' 

where /x^ is the standard renormalization scale of dimensional regularization, and has nothing 
to do with the scale /z that separates QCD calculations from the parton shower. Here, B 
denotes the Born cross section 

S = iV,g-^, (72) 



CM 
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with Nc = 3, Qq the charge of the quark, and Oem the fine-structure constant. We have 
also defined 



V 



6 



(73) 



The virtual diagram only contributes at ta = = 0, but is infrared divergent in 4 
dimensions. However, the real emission also diverges as tj ^0. Integrating these expressions 
over the allowed phase space < < 1 — tj, < 1 we find for the various contributions to the 
total cross section 



aB = B , 

agCp ( 47r/i2^ ^ 



ay 



Or 



27r 
27r 



B 



rp2 



^,2 3 1 9 771^ 



(74) 



such that the infrared 1/e divergences cancel and the total cross section gives the well-known 
result 



O^NLO = (Jb + Ov + <JR = B 



1 + V+ 

27r V 2 6 



B 1 



3 OisCp 
2 2ti 



(75) 



in the e — > limit. 

The singularity structure of the real emission in QCD is reproduced by the well-known 
Altarelli-Parisi |B6| splitting function 



hi 1 Zi 



where Zi are defined as the energy fractions 



Eq + Eq 1 + ta 



E, 



Zb 



Eq + Eq 1 + tf, 



(76) 



(77) 



Rewriting the splitting functions in terms of the variables ta and t^, taking into account the 
relevant Jacobian factors, we can define the analog of the Q functions in the toy example 



Qa(ta, h) 
tah 



ta(ta + h) 



1 + 



1+ta 



Qbita, tb) 
tatb 



tbita + h) 



l-tg 

i + h 



, (78) 



where Qa is the splitting function for q —>■ qg and Qb for q qg. One can easily verify that 
the sum of the two splitting functions reproduces all the singular behavior of the function 

R{ta, tb) 

R{ta-,tb) - Qa{ta-,tb) - Qb{ta-,U) n , n r^n\ 

= finite as . (79) 



For simplicity, we are only including e+e — > 7* — !■ partons. An intermediate Z-boson can easily be 
included by the appropriate change in the Born cross section B. 
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For later convenience, we also define a Sudakov factor 



AQ(ti,t2) = exp 



Itx 



dta 



(80) 



H2 JO tatb 

which is symmetric in a <-> 6. The tb integration range goes down to zero because this 
corresponds to the proper integration range in the original 2;a-dependent splitting function. 

For technical reasons relating to the conservation of four-momentum, the parton shower 
in the companion paper n actually uses the splitting function 



Qaita, tb) 
tatb 



tb 



VT^a 1 



1 



2tb + ta-l 



(81) 



and Qbita,tb) = Qaitb,ta)- These Q functions are perfectly consistent with the condition of 
Eq. ([TSp as longer as the simultaneous limit t^, t;, — > is avoided. This double-soft divergence 
leads to extra subleading logarithms in the Sudakov factor. Because we only ever work to 
leading-logarithmic order, this is not a problem, though the specific distributions obtained 
using Q vs. Q will of course differ. Eventually, one would want to tune the Q functions to 
minimize this subleading effect. In addition, one can use the difference as an estimate of 
subleading-logarithmic effects. 



D. Partonic Cross Sections at LO 

In analogy with Sec. |3 D| , we can define partonic cross sections for the master formula 
Eq. ([68| ) using the above QCD calculations. For simplicity, we will largely ignore the depen- 
dence on Vt^ and (j) and focus only on ta and tb- 

If we only had access to a parton shower, then it might distribute events according to: 

Parton Shower (LL): 



asCp 
27r 



.J2a tatb J2u tatb 



tb) 



dci'iaita) asCp a Qaita, tb) . 2 /i . 

— n An i,t 



dtadtb 271 tatb 

dosM ^<^B 9l!^^ AUl, tb) . (82) 

dtadtb 2tI tatb ' ' ^ ' 

The details of the parton shower algorithm that yield this kind of distribution are given in 
the companion paper |T]], though we note that this shower treats the gluon emission from the 
quark and antiquark symmetrically. The fact that the Sudakov factors are always squared 
comes from the fact that in a symmetric shower, the no-branching probability for the quark 
and the antiquark are tied together. 

For the present purposes, the most relevant information about Eq. (^) is that these 
functions include the correct leading-logarithmic resummation through the Sudakov factors 
while reproducing the Born cross section: 

a,i^^) + / dt. dtb + / dt. dtb = B . (83) 

Js dtadtb Js dtadtb 
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The integrations over the 2 regions in a2 are necessary to get the total cross section correct, 
and correspond to the fact that in the shower, a gluon can spht off at such a large angle from 
the quark that it becomes singular with the antiquark and vice verse, effectively yielding 
2-jet events, as shown in Fig. |n. 

If we now have the tree-level real-emission calculation and ignore leading logarithms, then 
the most naive partonic cross section is 

Tree Level (LO): 



das 
dtadtb 



5, 

OisCp o Rita-, tb) 



27r 



B 



tr, 



(84) 



At this point, we do not have enough information to define separate dcxsa and dcr^j, cross 
sections and will rely on some unspecified a, function to determine the relative weight of 
the two different dMCs samples, as in Eq. (|5^). 

While Eq. ( p^ gives the correct leading order behavior for 3-jet differential distributions, 
the total cross section has a strong dependence on the unphysical matching scale, which can 
be seen by expanding the resulting expression for the total cross section around small values 
of /t^. This gives 



a = B 



1 + I 4 log /i + 61og/i+ - - — 



+ 0(/i^ 



(85) 



This double-logarithmic dependence on the matching scale is of course due to having used a 
fixed-order calculation, which by construction does not sum any of the leading-logarithmic 
behavior in the cross section. 

The most precise way to determine the properly resummed expressions is to use renor- 
malization group evolution in an effective field theory setup to determine the partonic cross 
sections. However, one can use the fact that parton showers do sum the leading-logarithmic 
behavior correctly to derive expressions at this order. Just as in the toy model, we can 
Sudakov-improve the matrix element by finding a suitable merging of Eqs. 



^ and (g). 

Since there are two event samples describing the single gluon emission, care has to be 
taken to minimize the effect of large log^(ta/tb) logarithms. The simplest way to do so is to 
find an expression that reduces to Eq. (|82D if we were to take -R(ta, h) = Qai^a, tb) + Qbi'ta, tb), 
which guarantees the Sudakov-improved result will have the same singularity structure as 
the leading- logarithmic result. We choose 



Sudakov-improved (LO/LL): 



0^2 (/i^) 

dasajta) 
dtadtb 

dasbitb) 
dtndth 



fi|A^(l,/i2) + 

O^SCP Rjtq, tb) 

271 tatb 
C^sCp Rjtg, tb) 

2tt t„th 



2ti 



dta dtb 

2. tatb 



Qb 



Qa + Qb 



/^nihtb 



dtadtb-^ Alii, tb 



(86) 
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where we have omitted the dependence of the sphtting functions on the phase space variables. 
The Sudakov factors in dcr^ are weighted by the relative splitting probabilities of their 
respective shower history, which is equivalent to the approach of Ref. By integrating 
these expressions and expanding around small values of /t^, one can easily verify that the 
leading- logarithmic dependence on /t^ has vanished/^ and to leading order in one finds 



a = B 



l + ^(5_121og2) + OK2) 



(87) 



Note that the Sudakov-improved and tree-level results for das agree to leading order in a.^- 
Eq. (p6D is by no means unique. For example, the dominant shower history used in the 



CKKW prescription is morally equivalent to 

Dominant History Sudakov-improved (LO/LL): 



a2(A')=i?Aj(l,/i2), 

dcr^aita) asCp „ Rita,tb) 
— 1 1 ~ ^ 

dtadtb 2ti tah 

da3b{tb) UsCf „ R{ta,tb) 
£3 



The reason that this expressions works is that 

R{ta, tb) 9{tb > ta) — Qa(ta, h) 



d{tb>ta)^l{l,ta), 



finite as tn 



0. 



(88) 



(89) 



The lack of the 2 integrations in (J2 can contribute at most at the subleading-logarithmic 
level. As expected, there is no leading-logarithmic dependence in the total cross section for 
the dominant history method^^ 



o = B 



1 + ^(5- — -121og2)+0(a 



(90) 



E. Partonic Cross Sections at NLO 

We now want to include the virtual corrections to the 2-jet rate to obtain partonic ex- 
pressions that give both differential distributions as well as the total cross section correct 
to 0[oL^. As in the toy model, we will slowly build up towards an answer that has all 
the desired properties. Ignoring the leading logarithmic dependence of the partonic cross 
sections, we can use an NLO slicing method [j73|, |73|, |7^, |7^, 

NLO Slicing (NLO): 



The fact that the subleading single-logarithmic terms a^log/i vanished as well is an accident, and will 
not in general be true for the Q splitting functions or for gluon splitting functions. 
^® Again, the lack of subleading single-logarithmic terms as log p, is an accident. 
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dtadtb 



OisCp R{ta, tb) 



27r 



where 



B 



1 + 



2t[ e^O 



lim — 



dta dth 



2 + 2a 



(91) 



(92) 



Note the cancellation between the terms that are divergent as e ^ 0, such that is a finite 
expression, which only depends on /t^. It is a simple exercise to show that the total cross 
section reproduces the NLO cross section in Eq. ([751). As for the LO calculation, we do not 
have enough information to separately determine da^a and da^i, at this point. 

While Eq. (pi]) does reproduce both the differential and total cross sections correctly to 
0{as), it does not sum logarithms and has large fi"^ dependence. The leading logarithmic 
resummation can be included employing a subtraction method similar to MC@NLO ^ 
and we find 

NLO Subtraction (NLO/LL): 



a2(/i^)=i?(/i^) A^(1,A') + 



+ 



dcTa 



dtadtb 



271 

27r 



dt„dt.^^^^^A3,(l,4)+ / dt.dt, 



2a ^atb J 2b 

Rita, tb) — Qaita, tb) — Qb(ta, tb) 
tai tb 



B 



Qbita, ^fe) A 2 /I , \ 
— ^Q[i-,tb) 



+ Bif^^) (M^LllA AJ(1, ta) + AJ(1, tb) 



t. 



ai^b 



where 



B 



1 + 



2tt 



dta dtb, 



Rjta, tb) — Qajta, tb) — Qbjtg, tb) 
tatb 



(93) 



(94) 



Again, this reproduces the correct differential 3-jet distributions at 0{as) as well as the total 
NLO cross section of Eq. (^). Eq. ( p3D only defines da 2, 1 {dtadtb), but not the individual 
da ^i/ {dtadtb). To avoid large \o^{ta/tb) logarithms, we have many options. The simplest is 
to use Eq. ([70|) to define da^a and dasb via 



"a 



Qa 



ab 



Qb 



Qa + Qb Qa + Qb 

To see that this works, imagine that R = Qa + Qb, and note that 

Qa 



Qa + Qb 



1 as tn 







(95) 



(96) 



which shows that the singularity structure will correspond to the right leading-logarithmic 
behavior. In fact, Eq. (|95|) is a general strategy for minimizing the effect of leading- 
logarithms between intermediate scales, and we will use a variant of this approach in the 
next section. 
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As was true in the case of the toy model, the particular form of the NLO/LL partonic cross 
section in Eq. (p^) is not unique. The criteria we wanted to satisfy is the proper resummation 
of the leading double-logarithms while simultaneously reproducing all observables to 0{as) 
accuracy. As in the toy model, we can get a simple form for an NLO/LL partonic cross 
sections in analogy to Ref. [^ : 

NLO Elegant (NLO/LL): 

0-2 (/i^) = ctnlo , 

das asCp T{ta,h) 

- o-NLO — — — AT[mm(4, 4)J , (97) 



where ctnlo is given in Eq. (ffSl). The function T(ta,tb) is defined by 



T{ta,h) = Rita,h), (98) 

0"NLO 



and has an accompanying "Sudakov factor 



)5 



Axit) = exp 



2vr 73(4) tatf, 



(99) 



where 3{t) = d$3(t) denotes the region of phase space with ta,tb > t. Note that Axit) has 
the same double- logarithmic dependence as AQ(l,t), ensuring that the leading-logarithmic 
dependence is reproduced correctly. To order a^, we are also clearly reproducing both the 
differential spectrum and the total rate. As before, da^a and da^b can be defined using 
Eqs. (|7y) and ( p5| ) to minimize the effect of \og^ (ta/tb) logarithms. Eq. (|97D will be the 
cornerstone for the NLO/LO/LL result we will present in the next section. 

In our entire discussion above, we never talked about angular dependence. To keep the 
discussion simple, we have implicitly assumed that the angular dependence factorizes, but 
we do have to be mindful that in general, operator mixing can break this factorization. In 
any case, we can easily generalize Eq. ( ^7\) to have angular dependence by 

NLO Elegant with Angles (NLO/LL): 



d(72 (/t^ 



dn-dtAW = ^ ^ Ub ^"f""^'- ''^^ ' 

where the normalized /(f^f^) function contains angular information about 2-jet distributions 
and we have introduced the generalized real emission function R(ta,tb) R{Q2 y'^ayh, <P)- 
There are two different ways of defining the function T that appears in the Sudakov factor, 

T{nlt^,tbA) = ^R{^lt^,tbA) or _^ mit t (lo^L) 

c"nlo o"nlo J[^h) 

In either case, the Sudakov factor is given by 



Ay(t) = exp 



T{nl,ta,tb,(l)) 



(102) 



dt„dU I dm 

'3{t) 

The first choice is easier to implement and is the strategy adopted in this paper. The second 
option is likely to be more correct, but the difference can at most be at the subleading- 
logarithmic level. 
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5. NLO/LO/LL MERGING 



For the first emission in QCD, we could understand everything analytically, but the 
calculations quickly become unwieldy with more than one emission. Both for simplicity of 
discussion and simplicity of the GenEvA strategy, it is necessary to define more generally how 
to construct an LO/LL sample and how to merge together nested descriptions of QCD. 

As we saw in the previous section, in order to properly treat the double-logarithms of 
ta/tb, we needed two event samples, dMC3(ta) and dyiC^itb). Since the parton shower has 
two possible histories to generate the 3-parton final state {q qg or q qg splitting), these 
two event samples can be interpreted as representing these two possible shower histories. 
This will generalize for more partonic final states, and we will have a master formula which 
contains different event samples for each possible shower history, in the process giving an 
unambiguous definition for the phase space projection in Eq. (^. This will allow us to write 
the NLO/LO/LL sample by combining the NLO/LL, LO/LL, and LL descriptions, just as 
in the toy model. 

A. Generalized Master Formula 

To build the generalized master formula, we first introduce the concept of a would- 
be shower history. Though phase space with a matching scale does not require us to ever 
talk about parton shower histories, they are convenient for resolving logarithmic ambiguities 
since shower histories track QCD singularities, so one should really think of these histories as 
"singularity histories" . All of the possible leading-logarithmic ambiguities can be captured 
by introducing a separate event sample for every kind of allowed shower (or singularity) 
history. This gives the generalized master formula: 

d-=EE^^7^dMC„(;.„,), (103) 

n=2 J 

where j labels different shower histories for n final states, and we start running the phe- 
nomeno logical shower at the scale finj for each sample. Note that, similarly to Eq. (|68|), 
Eq. (|103D does not introduce double-counting, but merely reflects the scale ambiguity of 
QCD, and we anticipate defining dcr„j in analogy with Eq. (|69|) . 

As anticipated in Sec. P Q , dMC„(/x„j) is defined by excising regions of n-body phase 
space that are covered by running the phenomenological model on (m < n)-body phase 
space starting from the various fimj scales. With so many matching scales, it is convenient 
to choose finj equal to a common scale fi, except for /^nmaxj) which has to be equal to a 
scale "tj" in order to cover all of phase space without as\og^{tj/tk) ambiguities. Because 
scale-dependent ra-body phase space does not depend on there is no need for special 
treatment of the rimax-body phase space. Assuming that the phenomenological model is a 
virtuality-ordered shower, the excised phase space is 

d<l>„(;U) = d<^n{PCM;Pl,P2, ■ ■ ■ ,Pn)Y[0[{Pi + Pj? > f^^] , (104) 

ioj 

where ioj indicates pairs of partons that are associated with a QCD singularity. 

Since we are choosing all of the ;Un<nmax scales to be the same, the master formula in 
Eq. ( |103| ) is highly redundant for n < rimax, because many of the dMC„(/i„j) event samples 
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are multiply covered. Despite this redundancy, keeping the notion of a would-be shower 
history is convenient for the purposes of having a definition of phase space projection as 
in Eq. (||). That is, a shower history defines a map between n-body phase space and a 
lower-dimensional m-body phase space 

{$„, (nj)} {^rn, (mk)} for m < n , (105) 

where the specifics of this map are determined by the details of a showering or clustering 
procedure. Note that this map does not necessarily give a unique map $m, but rather 

gives a series of different consistent maps depending which shower history j is being selected. 
One could choose to implement a unique map — $m by only considering one type of 
shower history for a given phase space point. 

Given a phase space point with a would-be shower history (nj)}, we can always 
define the scale tj that we would need to start the phenomenological shower from to eliminate 
leading-logarithmic dependence. To do so, we simply ask at what scale the phenomenological 
shower would have generated the n-th branching. This is the scale we use to define /Xnmaxi) 
and it guarantees that there are no phase space dead zones. 



B. Sudakov Improvements 

We now wish to extend the LO/LL result of Eq. (^) by adding additional tree-level 
emissions. As discussed above, an LO/LL result is obtained by supplementing a tree- level 
matrix element with appropriate Sudakov factors. That is, we wish to combine the n-body 
tree-level matrix element (LO) 

d^LO 

that has no notion of a matching scale /i, with Sudakov factors taken from the equivalent 
partonic cross section for a set of n-body parton shower histories (LL) 

d$n 

where j labels the different shower histories for n final states. 

Note that schematically, the expression for the LL "calculation" can be written as a 
product of splitting functions and Sudakov factors, 

'I' ' = i? n Q^,r n = B g, A, (/..,) , (108) 

r=l s=l 

where B is the initial hard scattering matrix element, r labels the different 1 — > 2 splitting 
function vertices, and s labels the partons in the shower history. For convenience, we have 
introduced the notation 

n-2 2n-2 

Qj = n <5i,r- , Aj(/inj) = n ' ^^09) 

r=l s=l 

such that for the j-th shower history, Qj is the product of all splitting functions and Aj{finj) 
is the product of all Sudakov factors. Since the parton shower correctly reproduces the 



43 



singularity structure of QCD, the tree-level calculation should share the same singularity 
structure as the sum over all shower histories 



lim 

<I>„-^sing. 



finite. (110) 



d$„ 

Accounting for these singularities, one possible Sudakov-improved partonic cross section 

is 

Sudakov-improved (LO/LL): 



d$„ d$„ V ^ J d$„ 



(111) 



Expanding this result to leading order in a^, all of the A factors in dcr^j" are equal to unity, 

so summing over all of the da^^^^^^ expressions yields the tree-level result as desired. By 
construction, this answer has the correct leading-logarithmic behavior because it has the 
same singularity structure as the parton shower. 



Eq. ([ml) is a correct LO/LL answer, but there are various choices we can make for da; 



LL 

nj ■ 

The first is to use the naive LL cross section from Eq. (|108|) , which includes the sum over 
all possible shower histories. One can simplify this result, by following the path taken by 
CKKW ||2^ , and use the dominant shower history, defined as the one with the largest value 
of Qj. In QCD, there is never an ambiguity as to which shower history is dominant, and 
the dominant shower history always reproduces the QCD singularities. Thus, 

Qdom -I Qothcr q (112) 



Taking these limits as equalities, the CKKW procedure essentially defines: 
Dominant History Sudakov-improved (LO/LL): 

da^f^'-'^ifinj) _ j ^A,(/x„,) J = dom, ^^^^^ 



d$„ 



j = other. 



Eqs. ( |111|) and (|113|) share the same singularity structure, so they are both valid LO/LL 
results. 

A different route is to try to improve on Eq. ( |108|) . As we saw in Eq. (|82D, there are 
additional n integrations that are necessary to get the total cross section correct in the parton 
shower. From the shower point of view, these integrations correspond to situations where 
the parton shower populates a region of phase space that has QCD singularities but uses 
a splitting function away from the singular region. That is, two partons are "accidentally" 
distributed closer than the cutoff of the shower. Schematically, we can write this as 

d<(M^^/ r g.A.V (114) 



d$. 



These n regions do not exist as separate event samples, but their effect should be captured 
in the n-body event sample. 
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At this point, it is not at all clear how to figure out which lik integrations correspond to 
which shower histories j. At the leading-logarithmic level, these n integrations can simply 
be ignored because they are not associated with the dominant QCD singularities and give at 
most subleading-logarithmic corrections. However, for high enough multiplicity, the phase 
space volume of all the n regions is rather large, especially for moderate /i, so numerically 
it is dangerous to ignore them. On the other hand, if the only reason we are keeping the n 
integrations is for overall normalization, then it would be simpler to just introduce ad hoc 
/c-factors to restore the normalization. 

As discussed in the companion paper |I| , there is a numerically efficient way to accomplish 
the n integrations, so the user can address the n subtleties if desired. In particular, the 
GenEvA algorithm has a way to figure out which "wrong" histories k should be associated 
with which "right" history j, and it does this in a way that is reasonably efficient. Because 
the resulting differential distributions using Eqs. ( |108| ) and ( |114| ) are different, the two 
different choices can be used to test for systematics in the LO/LL sample. Note that for 
n = 3, using Eq. (|1 1 1| ) with Eq. ( |114| ) reproduces the result of Eq. (0). To show our results 
in Sec. H, we will always use Eq. ( |114| ) because of the reduced cross section scale dependence. 



C. Merging Nested Descriptions 



In the case of LO/LL merging in Eq. ( |111| ) or NLO/LL merging in Eq. (|97D , we were trying 



to merge together two different descriptions of the physics that both give valid predictions 
at some scale /i. More generally, we are interested in cases like in Fig. 13 where there is one 
description that gives a better description of the physics at some high scale and one that 
gives a better description at a lower scale. To build an NLO/LO/LL merged sample, we 
want to use an NLO/LL result at high energies, but supplement it with additional LO/LL 
information for subsequent emissions. 

From the toy model in Eq. ( ^5]) we learned that the NLO/LO/LL result for the par- 
tonic results with more than three partons are given by the LO/LL result, except that the 
first Sudakov factor changed to At- How do we obtain this new Sudakov factor, which 
depends on the kinematics of the first emissions? Consider the differences between the da^ 
expressions for the LO/LL and NLO/LL merged samples in Eqs. (|86|) and (pTj). We will use 
the convention of Eq. (|95|) to define cr^a'^'^^^ NLO/LL result. The ratio of the two 

answers is 

^^NLO/LL^^^.^ / ^^LO/LL^^^^ ^ Ar[min(ta, t;,)] 

dtadh I dtadh A^(l,4) ' ' 

and therefore can be used to extract the Sudakov factor At from the NLO/LL calculation.^^ 
While Eq. ( |115| ) is a function of 3-body phase space, we want to apply this correction 
factor to n-body matrix elements. This is straightforward, using the map from Eq. ( |105[ ) to 
define the map 

{<l>„,(nj)}^{$3,(3A;)} forn > 3 . (116) 



The fact that the numerator of Eq. (115) is a function of min(ta,if,) instead of ia means that there are 
power suppressed logarithmic ambiguities of the form (ia/ib) log(ia/i;)), which are beyond the order we 
are working. 
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Note that this imphes that the specific value of the correction factor will depend on the 
particular would-be shower history and on the details of the $3 map. By assumption, 

the map in Eq. ( |105| ) respects QCD singularities, so any differences are formally beyond the 
order we are working to, but different choices will affect the specifics of the NLO/LO/LL 
merging. 

Putting these pieces together, we find an expression for the best partonic calculation that 
GenEvA can currently implement 

GenEvA Best (NLO/LO/LL): 



d$2 

^^NLO/LO/LL 



d$.q 



dcr. 



NLO/LL 



(/i3j) dcT; 



d<l>2 

LO/LL 
3i 



d$3 



(/isj) dcr. 
X 



NLO/LL/, 
3j I'^i. 



d$3 



■da5;«/^^ 



it,) 



da. 



NLO/LO/LL 
nj 



njj 



d<f,„ 



LO/LL 




;ii7) 



where the value of rimax is determined by the availability of high-multiplicity tree-level matrix 
elements, and {3k) is determined uniquely from {nj) by Eq. (|116|) . Note that Eq. (|115D is 
still needed to define da^j. This is analogous to Eq. (|55D , where an additional Sudakov factor 
Aq was needed once extra emissions were added. The reason is, that the NLO/LL sample 
had rimax = 3 and therefore took fi3j=tj, whereas in general the matching scale fi^j will be 
lower than tj. Taking rimax = 3 and fi^j = tj, Eq. (|117|) reduces to Eq. (|97|) . 

As in the toy model, Eq. ( |117| ) is a special case of a more general construction to merge to- 
gether nested descriptions of QCD. In analogy with Eq. (^), if we have partonic descriptions 
A, B, C, . . ., then these can be consistently merged via 

Best Combination (A/B/C/---): 



dcr^j^^^^"'{Hnj) _ da:^j{finj) 



d^r 



d^r 



X 



d<l>. 



d^n' I d$,„/ 



d<l>„" 



118) 



This merging procedure does not spare the user from the need to first merge N*LO and N-^LL 
descriptions that occupy the same n-body phase space. Rather, Eq. (|118|) gives the user one 
simple option how to supplement low-scale/high- multiplicity calculations with additional 
high-scale/low- multiplicity information. 



6. RESULTS 

In this section, we present the results obtained using the GenEvA program, which imple- 
ments the GenEvA framework, as explained in Sec. P F|. As such, it uses the GenEvA algorithm 
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Notation 


Given by Eq. 


Description 


LOn 


dH) 


Tree-Level (LO) 


LO„_/LL 


(|mD with (|ii|) 


Sudakov-Improved (LO/LL) 


NLO2/LO3/LL 


m 


NLO Elegant (NLO/LL) 


NL02/L0„_/LL 


m 


GenEvA Best (NLO/LO/LL) 



TABLE IL Notation and meaning for the various event samples. LOn^^ax corresponds to tree- 
level matrix elements with 2- through nmax-emissions, and LL indicates the inclusion of leading- 
logarithmic information. In this work, we only consider virtual diagrams for 2-parton final states, 
denoted by NLO2. Note that the 3-parton matrix elements in the LO3/LL and NLO2/LO3/LL 
samples are not the same, as Eqs. (^) and ( p7[ ) include different Sudakov factors. LOn^iax/LL 
and NLO2/LO3/LL provide implementations of PS/ME merging and PS/NLO merging, respec- 
tively. The NL02/L0nnjax/LL sample is the best partonic calculation implemented by GenEvA, and 
simultaneously achieves PS/ME merging and PS/NLO merging. 



to generate Lorentz-invariant phase space, and then reweights the resulting events to the 
distributions discussed in this work. The current implementation of GenEvA is capable of 
describing e^e~ n jets, but only describes massless quarks adequately. Therefore, we 
do not present results for the production of 6-quark jets. A validation of the GenEvA event 
generator is given in the companion paper f^. 



For the fixed-order tree-level matrix elements we use the HELAS |^0| Fortran routines 



generated by Madgraph [^], and GenEvA currently utilizes matrix elements with up to n-^ 



6 final state partons. Final states with more than nmax partons are obtained by a subsequent 
parton shower, and in general any virtuality-ordered shower could be used for that purpose. 
For simplicity, here we only use the internal analytic parton shower of the GenEvA algorithm, 
which is a virtuality-ordered shower that neither includes color coherence, running, nor 
the hadronization of the final state. For this reason, the results presented here should be 
viewed as a proof of concept, and not as a prediction of the shown distributions. Unless 
otherwise noted, all results in this section use 

EcM= 1000GeV, /i = 50GeV, AiR = 10GeV. (119) 

As we will see in Fig. |T3], the matching scale /i = 50 GeV is rather low in the sense that 
the single-logarithmic dependence that we do not account for becomes important. How- 
ever, we choose a low scale in order to accentuate the differences between different partonic 
calculations and leave enough phase space volume available for high-multiplicity partonic 
states. 

The notation for the various event samples used in this section is summarized in Table ||. 
We use the notation LO^^^^^ for tree-level matrix elements with 2 <n < rimax partons in the 
final state, and LL indicates that leading-logarithmic Sudakov resummation is being used. 
Unlike in the companion paper [Q], we always use fully merged samples that give a complete 
covering of phase space. For example, LO4 includes 2-, 3-, and 4-parton tree-level matrix 
elements and the remaining phase space is covered by the internal parton shower. The same 
is true for LO4/LL, except that the matrix elements are now Sudakov-improved according 
to Eq. ( |111|) . Note that even without the Sudakov improvement all divergences in LO^ are 
regulated by the matching scale /x. Note also, that since GenEvA has no dead zones, the 
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FIG. 15: Dependence of the total cross section on the matching sca-le /i for TT/max — 

3 (left panel) 

and njnax = 6 (right panel) for 20 GeV < fi < 1000 GeV. The horizontal line corresponds to (Tnlo- 



The markers show the generated results and the solid lines the fit to Eq. (120). For the LO and 
LO/LL samples a{EcM) = clO) and for the NLO samples (j(-E'cm) = cnlo- Lowering fi, higher- 
multiplicity matrix elements are used in place of the parton shower to generate further emissions, 
causing o"(/x) for the LO samples to scale like Oslog^/i. With a proper LO/LL merging, the scale 
dependence reduces to log fi. Including NLO2 information further reduces it to {og log ^)^ for 
NLO2/LO6/LL (right panel), while for NLO2/LO3/LL (left panel) the cross section becomes 
independent and identical to (Tnlo by construction. 



LO2 sample is identical to LO2/LL, because when only 2 — 2 matrix elements are used the 
parton shower always starts running at the center-of-mass energy. The matching scale fi can 
also affect the meaning of nmax- For example, an LO4/LL sample with fi = Ecu would be 
identical to an LO2/LL sample because in this case fi would be too high to allow for any 
additional emissions above fi. 

For one-loop corrected matrix elements with n < rimax final state partons we write 
NLO„j^^^, and we currently only use NLO2. For example, an NLO2/LL result corresponds to 
an LO2/LL sample supplemented with an appropriate NLO fc-factor, while NLO2/LO3/LL 
corresponds to fully consistent 0{as) results, which contain all 0{as) corrections from 
both virtual 2-parton and real-emission 3-parton diagrams, and including leading-log re- 
summation according to Eq. (pT]). Note that the 3-parton matrix elements in the LO3/LL 
and NLO2/LO3/LL samples include different Sudakov factors, as can be seen comparing 
Eqs. (p6| ) and (0). An NLO2/LO4/LL sample additionally incorporates 4-parton tree- level 
matrix elements following Eq. ( |117D , and similarly for NL02/L0„j^^^/LL. 



A. Total Cross Section Scale Dependence 



We begin by presenting the dependence of the total cross section on the matching scale fi. 
On the left panel of Fig. |T5| we show the results for nmax = 3 and on the right panel for nmax = 
6. At fi = EcM, the total cross section is given by a{EcM) = o"lo for the LO samples and 
a^Ecu) = o"nlo for the NLO samples. As the matching scale fi is lowered, higher-multiplicity 
matrix elements are used in place of the parton shower to generate additional emissions. 
Using only tree-level matrix elements, we expect a double-logarithmic fi dependence as log^ fi 
for the LO samples, but only a single-logarithmic dependence as log fi for the Sudakov- 
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LO3 


LO3/LL 


NLO2/LO3/LL 




LOg/LL 


NLO2/LO6/LL 




0.713 


0.893 


-0.037 


7.40 


1.203 


-0.043 


0-2 


0.840 


0.115 


-0.006 


2.74 


0.056 


-0.151 



TABLE III: Coefficients of the single-logarithmic (first row) and double-logarithmic (second row) 
dependence of the total cross section in Fig. |l^. Shown is a fit to a{jl) = ao + ^(ai log fi+a2 log^ fx) 
in the range 0.02 < jl < 0.2, where fi = fi/EcM and Eqm = 1000 GeV. 



improved LO/LL samples. For the NL02/L0„,^^^/LL samples, the full dependence is 
included, and the fi dependence is thus expected to only start at (a^log/i)^. For rimax = 3, 
this term is absent because the total cross section is /i-independent by construction. We can 



clearly see in Fig. |15] that the scale dependence is reduced as we go from LO to LO/LL to 
NLO/LO/LL. 

To check the expected scaling with /i explicitly, we fit the result for the total cross section 
obtained by GenEvA to the function 

(j(/t) = ao H — ^(ai log/t + 02 log^ /i) , where jl = . (120) 

This functional form neglects any contributions from power corrections of the form /i", 
which dominate for /t ~ 1 so we only use the range fi < 0.2 in the fit. The results of 



the fit are shown by the solid lines in Fig. |15], and the fitted parameters ai and 02 are 



given in Table |T|. As anticipated, the LO3 result has both single- and double-logarithmic 
IX dependence. Taking into account the overall factors in Eq. (|120|) the values of ai and 02 
for the LO3 sample are consistent with Eq. (^5)). The LO3/LL result, has a much smaller 
02, indicating that the log^/t term only starts at 0{al)^ i.e. is a single-logarithmic effect, 
while Oi is of comparable size as for the LO3 sample. Thus, as expected, LO3/LL still has 
a single-logarithmic dependence at 0{as). For NLO2/LO3/LL both ai and 02 are small, 
indicating that there are no logarithmic terms present at 0{as). 

For the samples with rimax = 6, the cancellation of the dependence becomes more 
dramatic. The LOe sample shows a much larger /i dependence than LO3 due to the ac- 
cumulated /i dependence of all matrix elements. Nevertheless, this large /i dependence 
cancels in the Sudakov-improved LOg/LL, which has similar coefficients to LO3/LL. Fi- 
nally, for NLO2/LO6/LL the value of 02 is consistent with a residual dependence of the 
size (a^log/i)^, as expected. 



B. Interpolating Kinematic Extremes 

Next, we study how differential distributions differ in the various implementations in- 
cluded in this work. We find the so-called C parameter ||82| to be a particularly useful 
observable to highlight these effects. Given the linear sphericity tensor |^ 

^ ^ |P.| (121) 

constructed out of the final state three- momenta pj, the C-parameter is defined as 

C = 3(AiA2 + A2A3 + A3Ai) , (122) 
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FIG. 16: PS/ME merging as implemented in GenEvA (LO/LL) with ^CM = 1000 GeV and /i = 
50GeV. The C parameter measures the "jettiness" of an event, with C ~ giving the 2-jet region, 
C < 0.75 roughly giving the 3-jet region, and C > 0.75 roughly giving the 4-jet region. Left panel: 
The LO3 sample has the soft-collinear divergence of the 3-parton tree-level matrix element and 
therefore becomes singular near C = 0, while the LO3/LL sample regulates that divergence with a 
Sudakov factor similar to LO2/LL. At large C, the LO2/LL sample lacks the quantum interference 
of the 3-parton matrix element, while the LO3/LL sample contains the correct interference effects 
and therefore tracks LO3. Right panel: The same interpolation for LO4/LL, which captures the 
correct Sudakov-suppression of the LO3/LL sample near C ~ and at the same time includes the 
extra interference effects of LO4 near C ~ 1. 



where Aj are the eigenvalues of S"*^. Since Aj = 1 we have < C < 1. Low values of the 
C parameter are dominated by events which are mostly 2-jet-like, while larger values of C 
indicate more and more final state jets. For planar events A3 = forcing C < 0.75, so that 
3-jet events are confined to C < 0.75, while events with four or more jets can contribute up 
to C = 1. 

We begin by studying the effect of Sudakov resummation on the tree-level matrix ele- 
ments, which is the GenEvA analog of PS/ME merging. It is well known that the leading- 
logarithmic resummation for the C parameter reduces the cross section for small values of 
C, with the resummed expressions given in Ref. ||S^. It is also well known that the pure 
parton shower result overshoots the correct QCD result for large values of the C param- 
eter, since the interference between the two QCD diagrams contributing to the emission 
of a single gluon is destructive. On the left panel of Fig. we compare the results for 
the C-parameter obtained by running the pure parton shower (LO2/LL), by using tree- level 
QCD matrix elements up to order (LO3), and the merged result with n^ax = 3 (LO3/LL). 
We can see clearly how the LO2/LL sample overshoots the LO3 result for large values of 
C, while it suppresses the cross section for small values of C. The merged LO3/LL result 
reproduces the result of the pure QCD matrix element for large values of C, where the 
leading logarithms are not important. The resummation of the double logarithmic terms 
becomes important for small values of C, and we can see the Sudakov suppression of the 
merged result compared to the pure matrix element for small values of C. 

Note that the merged result has to approach the pure parton shower result for C 0, 
and we have checked that this is indeed the case. The reason this is not obvious from the 
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FIG. 17: PS/NLO merging as implemented in GenEvA (NLO/LL). The NLO2/LO3/LL, LO3, and 
LO3/LL samples all share the same interference effects near C = 1, given by the 3-parton tree-level 
matrix element. Near C = 0, LO3 is too singular, while LO3/LL is too Sudakov suppressed. The 
NLO2/LO3/LL sample has the correct NLO cross section information and therefore has the right 
normalization near C = 0. Since the inclusion of NLO cross-section information affects the form 
of the Sudakov factor for the NLO/LL sample, the NLO2/LO3/LL sample is not just a fc- factor 
modification of the tree-level resummed LO3/LL result. 



figure is that the binning is too coarse for this effect to be visible, and the agreement does not 
happen until much smaller values of C.^° To see more easily that the desired interpolation 
does occur, we include one additional tree-level emission, and compare the results for LO4, 
LO3/LL and LO4/LL on the right panel of Fig. ITBl One can now clearly see that the LO4/LL 
sample reproduces the LO3/LL result for small values of C. The agreement between the 
LO4/LL and the LO4 sample for large values of C is not expected to be perfect, since the 
Sudakov suppression has some effect all the way up to C = 1. However, we do see that the 
agreement is being obtained asymptotically. 

Next, we show the effect of including NLO information, which is GenEvA's analog 
of PS/NLO merging. As we saw in Fig. 15, the tree- level calculation contains double- 
logarithmic terms which results in a rising cross section as the scale fi is lowered. For the 
C parameter distribution, this implies that the tree-level result diverges as — log(C)/C for 
C — > 0. Resumming the leading- logarithmic terms removes this dominant singularity for 
small values of C, however we also see in Fig. |15| that the cross section of the resummed 
result undershoots the correct (NLO) result. In Fig. [1^, we compare the tree-level matrix 
elements LO3, the resummed matrix elements LO3/LL, as well as the resummed NLO result 
NLO2/LO3/LL. For large values of C all three results agree, as expected, because all three 
samples have the same interference terms from the tree- level 3-parton matrix element. How- 
ever, for C ^ 0, the NLO2/LO3/LL result has the correct 0{as) normalization, while LO3 
is too singular, and LO3/LL too Sudakov-suppressed. Note that the NLO2/LO3/LL result 
is not simply a fc-factor modification of the tree- level result, because as seen in Eqs. (|86D 



Because of the single- logarithmic effect of the n integrations, the LO3/LL sample first overshoots the 
LO2/LL sample until finally asymptoting from above. With the n integrations turned off, the agreement 
happens at more moderate values of C, but still smaller than our bin size. 
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C Parameter C Parameter 

FIG. 18: The combination of PS/ME merging and PS/NLO merging in GenEvA (NLO/LO/LL). 
Left panel: The LO4/LL sample is the analog of PS/ME merging and contains the correct interfer- 
ence of the 4-parton matrix element near C = 1, while the NLO2/LO3/LL sample is the analog of 
PS/NLO merging, and has the correct cross section information near C = 0. The NLO2/LO4/LL 
sample smoothly interpolates between the two different regimes. Right panel: The same interpo- 
lation for Umax = 6, where NLO2/LO6/LL smoothly interpolates between LOg/LL at large C and 
NLO2/LO3/LL at small C. The kinked behavior near C = 0.75 is a well-known physical effect 
p2| , unrelated to the merging procedure. It occurs because C < 0.75 for planar events, meaning 
that 2- and 3-parton matrix elements cannot contribute much for C > 0.75. The NLO2/LO6/LL 
sample is the best partonic calculation currently implemented in GenEvA. 



and (0), the Sudakov factors in the 2- and 3-parton matrix elements must be modified to 
get the correct NLO cross section, while still incorporating leading-logarithmic results. 

We now consider the NLO/LO/LL partonic calculation which combines NLO cross section 
information with higher-order tree-level matrix elements, all Sudakov- improved. We expect 
that in the 4-jet region (C > 0.75), the NLO2/LO4/LL sample will agree well with the 
LO4/LL result, since 3-parton states do not contribute much in this region. For small 
values of the C parameter, however, we expect the NLO2/LO4/LL result to be close to 
the NLO2/LO3/LL result, since the NLO cross section information is important there. The 
results are shown on the left panel of Fig. which confirms the expected interpolation. On 
the right panel of Fig. |18|, we give the result of the best partonic calculation current available 
in GenEvA: NLO2/LO6/LL. This sample includes additional interference terms from tree- level 
matrix elements with n < 6, while still maintaining NLO/LL accuracy. We clearly see that 
the NLO2/LO6/LL sample interpolates between the LOe/LL (PS/ME merged) sample in 
the interference region {C ~ 1) and the NLO2/LO3/LL (PS/NLO merged) sample in the 
total cross section region {C ~ 0). In other words, the best implementation available in 
GenEvA simultaneously achieves PS/ME merging and PS/NLO merging. 

Finally, on the left panel of Fig. |T^, we illustrate the composition of the NLO2/LO6/LL 
sample, showing the individual contributions of the n-parton matrix elements, where n 
corresponds to the number of partons resolvable at the scale /i. Near the 2-jet region 
{C ~ 0), the n = 2 matrix element dominates as expected. The n = 3 matrix element fills 
out the region < C < 0.3, and since it gives mostly planar events, it turns off abruptly 
at C = 0.75. Despite the fact that C < 0.75 is supposed to correspond to the 3-jet region, 
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FIG. 19: The five components of the GenEvA Best NLO2/LO6/LL sample. The curves for different 
n show the individual contributions of the n-parton matrix elements, where n corresponds to the 
number of partons resolvable at the scale /i. Left panel: /i = 50 GeV. As expected, the n = 2 
component dominates near C = 0. Because fi is so low, the n = 3 and n = A matrix elements 
both contribute in the 3-jet (C < 0.75) region, where the crossover point C = 0.3 is directly 
related to the scale Above C = 0.75, the n = 3 component can no longer contribute, and the 
rest of the C distribution is filled out with additional contributions from the 4-, 5-, and 6-parton 
matrix elements. Right panel: /i = 100 GeV. For this more reasonable jJL scale, the various n- 
jet-like regions correspond more closely to the n-parton matrix element used. While the total C 
parameter distribution is only single-logarithmically sensitive to changing the /x scale, the different 
matrix element components shift dramatically, with the n = 6 matrix element now having very 
little available phase space. 

the n = 4 matrix element dominates for 0.3 < C < 0.75, because = 50 GeV is so low, 
that the n = 4 matrix element is really being used to determine some of the jet substructure 
in the 3-jet region. Above C = 0.75, the n = 4 and n = 5 matrix elements give roughly 
equal contributions, with the region near C = 1 supplemented by the n = Q matrix element. 
This is not surprising as the C parameter no longer resolves the difference between 4-jet and 
higher-jet event shapes. On the right panel of Fig. 0, the same C parameter is shown for 
/i = 100 GeV. With this matching scale, the separation of the C parameter into n-jet-like 
regions roughly corresponds to the n-parton matrix elements. Note that the n = 6 matrix 
element now barely has phase space available to contribute to the total distribution, so the 
remaining matrix elements shift to compensate. 



More Results Prom GenEvA Best 



To get more intuition about the GenEvA Best NLO/LO/LL sample, it is helpful to use a 
jet algorithm to get more differential information than is available in an event shape measure 



like the C parameter. We use the Fast Jet package |8§] for that purpose, using the inclusive 
fc-r jet algorithm with R = 1.0 to identify jets, which are then ordered by their total energy. 



On the left panel of Fig. |2y, we show the invariant mass of the 2nd hardest jet. Events 
which are almost 2-jet-like contribute to small values of this observable, while events with 
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FIG. 20: Comparison of the GenEvA best sample NLO2/LO6/LL with the PS/ME merging analog 
LOg/LL and the PS/NLO merging analog NLO2/LO3/LL for two jet-based observables. Left panel: 
The invariant mass of the 2nd hardest jet, where we can clearly see the destructive interference in 
the matrix element calculation with n > 3 partons compared to the results where these additional 
partons are only generated by the parton shower. The NLO2/LO6/LL sample interpolates between 
the two comparison distributions, showing that GenEvA can capture important physical effects that 
cannot be seen by considering the NLO2/LO3/LL or LOe/LL samples alone. Right panel: The 
invariant mass between the 1st and 3rd hardest jets. Since no particular parton multiplicity 
dominates at large pairwise invariant masses, the three different samples have slightly different 
behaviors near the endpoint. 

well-separated partons contribute to larger values. As we saw in the previous section, adding 
additional tree-level matrix elements suppresses rates for well-separated partons due to in- 
terference effects in the full matrix-element calculations. On the other hand, the NLO 
information increases the cross section in regions of phase space which contribute mostly to 
2-jet-like events. Thus, we expect the NLO2/LO3/LL result to be higher than the LOg/LL 
sample in the whole kinematic region, which is seen in the figure. Combining these two 
results, we expect to reproduce the lower rate for well-separated partons, due to the inter- 
ference from the tree-level matrix elements, while at the same time reproducing the enhanced 
cross section for almost 2-jet-like events due to the NLO information. This is clearly seen in 
the NLO2/LO6/LL sample, which thus captures important physical effects that cannot be 
seen by considering the PS/ME merging or PS/NLO merging results alone. 

The right panel of Fig. [2^ shows the invariant mass between the 1st and 3rd hardest jets. 
Low values of this pairwise invariant mass correspond to the 2-jet-like region, and again 
the NLO2/LO6/LL sample inherits the NLO cross section information. As the pairwise 
invariant mass increases, this observable gets contributions from a variety of different event 
types, so the NLO2/LO6/LL sample tracks the shape of the LOg/LL result with an overall 
NLO cross section increase. At very large values of this invariant mass, all three curves give 
slightly different answers because this extreme kinematic region is sensitive to the exact way 
in which the jets are clustered, which is strongly affected by the exact ratios of the 4-, 5-, 
and 6-parton matrix elements. 

As seen in the C parameter, the NLO2/LO6/LL sample combines 5 different matrix 
elements, and it is interesting to see the individual matrix-element contributions for jet- 
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FIG. 21: The five components of tlie NLO2/LO6/LL sample for two jet-based observables. The 
curves for different n show the contributions of the individual n-parton matrix elements. Left 
panel: The invariant mass of the 4th hardest jet. Since GenEvA currently uses virtuality to define 
the scale /i that separates different parton-level multiplicities, the invariant masses of jets have 
sharp cutoffs for the different parton-multiplicity components, with the n = 4 sample turning off 
at the matching scale around 50 GeV as expected (the small tail comes from the jet algorithm 
"accidentally" clustering together two different quark flavors). Right panel: The invariant mass 
between the 2nd and 5th hardest jets. As this pairwise invariant mass increases, the dominant 
n-parton matrix element smoothly changes from n = 3 to n = 4 to n = 5. 



based observables. On the left panel of Fig. |2T|, we show the results for the invariant mass of 
the 4th hardest jet. Since our evolution variable is equal to the virtuality between partons, 
the parton shower can only generate virtualities below the staring scale of the shower, which 
is chosen as /i = 50 GeV. Thus we expect that a 4th jet with invariant mass above 50 GeV 
can only be generated by a partonic calculation with at least 5 partons in the final state. 
This effect is reproduced by GenEvA, and while there are sharp cutoffs in the individual 
contributions of n = 2,3,4-parton matrix elements, the combined NLO2/LO6/LL result is 
relatively smooth over the entire range of the invariant mass of the 4th hardest jet. Note 
that there is a slight kink in the final distribution at the matching scale 50 GeV, which gives 
a sense of the size of the subleading-logarithmic errors one makes in this merging. 

For the pairwise invariant mass between the 2nd and 5th hardest jets, shown on the 
right panel of Fig. pTl the individual partonic contributions do not cut off as sharply as for 
the previous case. The dominant matrix element smoothly changes from n = 3 for small 
invariant mass, to n = 4 for intermediate masses, to n = 5 towards the endpoint of the 
distribution. The total result is again a very smooth function over the entire range of the 
inter-jet invariant mass, with perhaps a slight kink at 175 GeV from the same n = 4 to n = 5 
transition that gave a kink in the previous distribution. 

As a final application, we study the scale dependence of a differential distribution for 
'^max = 4. We saw in Fig. O that the /i dependence of the total cross section was reduced 



There are n = 4 events that can have jet masses slightly larger than the matching scale, because the cutoff 
only applies to singularity-producing partons. One can see the effect of, say, the e+e~ uudd sample in 
the rare events that have a 4th jet with invariant mass upwards of 60 GeV. 
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FIG. 22: Comparison of the ^ dependence of the LO4, LO4/LL, and NLO2/LO4/LL samples. 
Shown is the distribution of the invariant mass between the 1st and 3rd hardest jets for five different 



values of the matching scale /u. Just as in Fig. 15, the inclusion of leading-logarithmic and NLO 



cross-section information reduces the dependence on the unphysical matching scale fi. At large 
invariant masses, the dominant reduction comes from including leading- logarithmic information in 
high-multiplicity matrix elements. At small invariant masses, there is an additional reduction in 
the n dependence from the inclusion of NLO cross-section information. 



when going from LO to LO/LL to NLO/LO/LL, and it is interesting to see the extent to 
which this reduced scale dependence is reflected in differential distributions. In Fig. we 
show the invariant mass between the 1st and 3rd hardest jets for these three samples. We 
see that the /i dependence is lowered going from LO4 sample (top left plot) to the LO4/LL 
sample (top right plot). We also see how the direction of the scale dependence is reversed, 
as expected from the scale dependence of the total cross section. In the bottom plot we 
show the result for the NLO2/LO4/LL sample. Including the NLO information does not 
change the scale dependence significantly for large values of the invariant mass, since this 
is the region where the real-emission diagrams are dominant. However, for small values of 
the invariant mass, i.e. in the 2-jet region, the scale dependence is reduced, mirroring the 
reduced scale dependence observed in the total cross section. The residual scale dependence 
is single logarithmic. 
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7. CONCLUSIONS 



We have presented a new framework for event generation, GenEvA, which allows almost 
any partonic calculation to be interfaced with a phenomenological model such as a par- 
ton shower. Because the phenomenological model is assumed to include a description of 
hadronization, GenEvA offers a method for inclusive partonic information to be used to pro- 
duce fully exclusive hadronic events. 

The main conceptual breakthrough is a definition of phase space with a matching scale. 
This matching scale cleanly separates partonic calculations performed in QCD from phe- 
nomenological models based on QCD. GenEvA avoids phase space double-counting by con- 
struction, because once GenEvA is interfaced with a parton shower, every point in pertur- 
bative phase space is covered once and only once. By appropriate inclusion of leading- 
logarithmic information in the partonic calculations, the dominant sensitivity to the un- 
physical scale which separates the partonic regime from the showering regime is removed. 

In this way, GenEvA allows theorists to focus on providing the best possible partonic 
calculations and not on the algorithmic details of implementing those calculations. While 
other solutions to the problems of double-counting and shower merging exist in the literature, 
they are tailored to specific partonic calculations. For example, though there are various 
methods to merge tree-level calculations with parton showers [^, ^ |2^, 

~ HI, 



28, 29 



30, 31, 32, 33, 34, 



usually entirely different procedures are necessary to merge NLO 

H. In 



a H ra, ra, 47 



calculations with parton showers [| 
contrast, GenEvA offers a generic solution to these problems, and the NLO/LO/LL merged 
sample provides an example of the kinds of improved partonic calculations that can be 
developed when algorithmic issues are separated from calculational ones. 

The name GenEvA is obviously inspired by the site of the upcoming LHC experiment at 
CERN, and we expect to generalize the GenEvA framework to deal with additional complica- 
tions present in a hadronic environment. We discuss some of these issues in our companion 
paper [Q], and we argue there that the complications are technical, not conceptual. Much 
of the legwork has already been done, as modern parton showers implement initial state ra- 
diation through backwards evolution, effectively defining the notion of a variable matching 
scale for hadronic collisions. 

From a theoretical point of view, the most interesting developments will be to implement 
more advanced kinds of partonic calculations. In this work, we only considered loop diagrams 
involving two final state particles, but there is a growing body of one-loop calculations 
with large numbers of final states (for a recent review see Ref. Because GenEvA is 

agnostic as to the method of regulating IR divergences in loop diagrams, the choice about 
whether to use a slicing method, a subtraction method, or an elegant method to supplement 
NLO calculations with leading-logarithmic information can be made on the basis of physics 



considerations alone. Also interesting will be the implementation of calculations |£l 
based on SCET 



57, 58 
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because it allows for a consistent treatment of subleading 
logarithms. 

From an experimental point of view, GenEvA offers a unique opportunity to assess Monte 
Carlo systematics, as a single GenEvA event can support multiple different theoretical dis- 
tributions, allowing theoretical errors to be probed without additional detector simulation 
time The NLO/LO/LL merged sample also gives a proof-of-concept that multiple dif- 
ferent theoretical distributions can coexist within a single Monte Carlo sample, allowing the 
experiments to use the best theoretical knowledge available for any given point in phase 
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space. With the possibihty for sub-GeV measurements of the top quark mass and aggressive 
use of Monte Carlo to extrapolate the Standard Model up to 14 TeV, we anticipate that the 
experimental collaborations will benefit from the flexibility and transparency of the GenEvA 
framework in the LHC era. 
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